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Abstract 


Given a set of equational axioms and two terms containing function symbols and variables, the 
equational unification problem is to find a uniform replacement of terms for the variables that makes 
the terms provably equal from the axioms. In the variable-only case, the two terms contain only 
variables and function symbols from the axioms. In the general case, the terms may contain symbols 
not appearing in the axioms, there may be more than one instance of a set of axioms, and there may 
be more than one set of axioms. 


This thesis presents a method for combining equational unification algorithms to handle the terms 
with "mixed" sets of function symbols. For example, given one algorithm for unifying associative- 
commutative operators, and another for unifying commutative operators, our algorithm provides a 
method for unifying terms containing both kinds of operators. {t is based on a general strategy for 
decomposing terms and combining unifiers. We restrict our attention to sets of axioms whose func- 
tion symbols are pairwise disjoint. 


A simplifying assumption is that we are working only with confined regular equational theories, a class 
of theories defined in this thesis. We present a unification algorithm that solves the general case 
unification problem for any combination of these theories, given variable-only case algorithms for the 
theories. The algorithm is proven totally correct. The termination proof is a generalization of Fages’ 
proof of termination for associative-commutative unification. 


Our algorithm has been implemented as part of a larger system for generating and reasoning about 
equational term rewriting systems. 
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Chapter One 


Introduction 


The unification problems are a class of problems involving a general form of pattern matching. As 
such, they occur independently in many contexts that involve symbol manipulation, the most 
pronounced of these being automated theorem proving. A unification problem can be formulated as 


a problem on strings. graphs. or algebraic objects such as sets or groups. 


The classical unification problem is: given two terms containing function symbols and variables, 
find a uniform replacement of terms for the variables that makes the two terms syntactically identical. 
Equational unification, or E-unification. extends the classical problem to solving an equation in an 
equational theory. That is. given a set of equational axioms, find a substitution for the variables in the 


two terms that makes them provably equal from the set of axioms describing the theory. 


In this thesis, we develop a framework for automatically combining E-unification algorithms for 
independent sets of operators by carefully analyzing the E-unification process in general and study- 
ing a number of equational theories in particular. Our approach is generalized in the following sense: 
Given a unification algorithm for E,, and a unification algorithm for E,, we can in some cases 
automatically generate an algorithm for the combined theory of E, and E., such that the resulting 


algorithm will unify terms with mixed set of operators. 


Unification was first described by Herbrand in 1930, and was first put to practical use by Robinson 
as a basic step in resolution [Robinson 65], an inference rule used as a complete proct system for first 
order predicate calculus. Because of its simplicity and power, the resolution rule is often used as the 
basis for automatic theorem provers and is also exploited in implementing the logic programming 
language Prolog [Kowalski 74, Clocksin 81]. In Prolog, unification acts as a procedure call 
mechanism, allowing procedures to be invoked when the arguments fit the pattern given in the 


procedure head. 


Unification is also used in type inference algorithms for languages such as ML [Milner 78], in 
which type inference is used as a compromise between strictly typed and typeless languages. These 
languages gain expressive power over explicitly typed languages that enforce strong type checking, 
because the type inference provides the programmer with a mechanism for a certain kind of polymor- 


phism [Mitchell 84]. 


Unification is also an important operation in ferm rewriting systems. These systems perform 
reasoning by compiling equations into a set of rewrite rules. This compilation, known as the comple- 
tion process, involves ordering each equation into a directed rule, finding pairs of rules that could 
apply to a single term, and sometimes adding new rules when such critical pairs are found. Unifica- 
tion is used in finding the pairs of overlapping rules and in generating the additional rules. If the 
completion process terminates successfully, the resulting system is called convergent, meaning the 
rules, applied in any order to a given input, will always result in a unique answer [Knuth 70]. A 
convergent rewriting system is a complete and terminating decision procedure for determining 
whether or not an equation is implied by the original set of equations. Term rewriting systems can be 
used as a basis for automatic theorem provers [Huet 82, Kapur 84, Goguen 80, Hsiang 82]. These 
theorem provers have been used for applications such as checking formal specifications [Goguen 
79, Guttag 83, Kownacki 84], interpreting logic programming languages [Dershowitz 88a, Fribourg 
84], reasoning about relational databases [Cosmadakis 85], and checking properties of petri nets 


[Choppy 85]. 


The unification algorithm described in this thesis was initially motivated by the need to extend the 
domain of applications for which the REVE term rewriting system generator [Lescanne 83, Forgaard 
84a] is useful. Because unification problems occur in many different applications, there are both 
theoretical and pragmatic reasons for developing a better understanding of the problem and its 
solutions. A condensed version of the work described in this thesis appeared previously in [Yelick 


85]. 


1.1 Organization of the Thesis 


The remainder of this chapter is devoted to giving the background necessary for understanding 
unification, both classical and equational, and discussing related work in the field. The rest of the 
thesis is divided into three chapters. Chapter 2 presents our generalization of the problem along with 
some some restrictions, gives our algorithm, and goes through a non-trivial example. Chapter 
3 presents a proof of total correctness for the generalized algorithm. Chapter 4 presents a summary 
of our conclusions, a description of the implementation, and ideas for future work. The reader can 
get of clear picture of our approach, ignoring the question of why the algorithm works, by reading 
Chapters 2 and 4. Three appendices are included. Appendix A gives some additional technical 
details of the proof of correctness, Appendix B gives a glossary of some terms used in this thesis, and 


Appendix C gives a list of special symbols and there uses. 


1.2 Definitions 


The following definitions are consistent with the definitions of [Fages 84] and [Huet 80a]. We 


begin with basic definitions of terms and functions on terms. 


Let V be a countable set of variables and F be a family of function symbols with associated arity 
such that V and F are disjoint. We recursively define the set of terms, T(F, V), as either a variable or a 
function symbol of arity n, followed by n terms. We assume the sets V and F to be fixed and, thus, use 
T in place of T(F, V) without ambiguity. Function symbols of arity zero, called constants, will be 
denoted by the letters a, b, c, d, and numerals 0, /, to be distinguished from variables, denoted by the 
letters u, v, w, x, yz. For readability, we will use the symbols, +, *, and * as binary infix operators. 


Examples of terms are f(x, a), h(x *0), andy + /. 


Given a term, t, let Y(t) be the set of variables in t and F(t) be the set of function symbols int. The 
root symbol of the graph representation of a term, t, will be denoted t.head-—t.head is a variable if t is 
a variable, and a function symbol if t is not a variable. Terms formed from function symbols alone, i.e., 


containing no variables, are called ground terms and are denoted by G. 


An occurrence in a term names a node within the tree structure of the term; occurrences are 
represented by strings of integers, including the empty string, e«. The set of occurrences of a term, 
denoted O(t), is defined as follows: 

1. If tis a variable or constant, then O(t) = {e}. 

2. lft=f(t,,...t,), then O(t) = {e}Uf{i.o] 1 sisn & o€O(t)} 

An occurrence can be used to index into a term as follows: 


1.t/e=t 
2. f (tyent,)Zi.0 = t/o 
An occurrence is said to be proper if it is not the empty occurrence and strict if the subterm at that 


occurrence is not a variable. 


A substitution is a mapping from variables to terms, extended to an endomorphism (a homomor- 
phism from a set to itself) on terms. l.e., if o is a substitution then Of (tys--st,) = S(ot,,.,0t,). A 
substitution will be denoted by a set of variable to term mappings, {¥ ght Vottorneds where all variables 
outside the set are implicitly mapped to themselves. The identity substitution, i.e., the substitution 
mapping every variable to itself, will be written 4. The universe of substitutions will be denoted S. We 
define the domain, 5, of a substitution, o, as follows: D(a) = {v | ov # v}. Note that this differs from 


the usual notion of a function's domain, since it contains only those variables that are not mapped to 


themselves. The range. 3. of a substitution is defined as Ria) = oe {av}, and the range 
y€2b(a) 
varables. J as ifa) = can 11‘). A substitution, o, can be restricted to a set of variables, V, written 
“CER a} 
of. by mapping all variables outside V to themselves, o|, = {+ ear | EV}. 
Thus. ifo = (xeu. ves{(z)}. then of(v) = f(a). ox(e,s) = g(z.f(2)) and oly pa ivea}. The 
domain. range. and range variables of o have the following values: Z(o) = {x,1}, B(o) = {a,f(z)}, 


and J(a) = {2}. 


A term, t. is said to be an instance of aterm, s. if and only if there exists a substitution, o, such that 
t=oas. When the domain of a is restricted to the variables in 1(s), o is unique and is called the match 
of t by s. Substitutions may be composed using functional composition, i.e., for any term, ¢, 
(a ,°o,)f = a ,(a,1). In the same sense that one term may be an instance of another, a substitution, 
1, is an instance of a substitution, Oy. if and only if there exists a third substitution, 7, such that o, = 


7°... 1n this case a, is said to be more general than o,. We denote this partial ordering by d, $9, 


1.3 Classical Unification 


This section presents the classical unification problem and provides an example of a simple 
algorithm for solving the problem. This classical algorithm will serve as a framework for our general- 


ized algorithm. 


Definition. Given two terms. t and s, a substitution, o, unifies t and s if and only if: 


In general, there is more than one substitution that will unify two given terms. For example, iff = 
Sfle(x),gQ))) ands = f(v.z) theno = {yee(x), z+2(g(x))} is a unifier, as wellaso = {3 «g(g(w)), 
ze g(g(z(w))), x¢g(w)}, and an infinite number of other substitutions. Observe in the preceding 
example that o’ is instance of 0: o = {x g(x)}ea. In fact, all unifiers of t and s can be written as 
some substitution composed with o, so we call o the most general unifier. The most general unifier of 
two terms is analogous to the least common multiple of two natural numbers; every multiple of two 
numbers is divisible by the least common multiple, just as every unifier of two terms is an instance of 


the most general unifier. 


Definition. The most general unifier of two terms t and s, is a unifier, 0, such that V p 
pt= ps => (J41, p=7°0). 


In classical unification, there is at most one most general unifier up to variable renaming. A simple 


recursive unification algorithm based on [Robinson 71] is given below. 
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unify = proc (t: term, s: term) returns (substitution) 
case 
is_variable(t) and is_variable(s) => 
return({t©s}) 
is_variable(t) => 
if t€ Vs) 
then failure: cycle 
else return({t ¢ s}) 
is_variable(s) => 
if s€ V(t) 
then failure: cycle 
else return({s ¢ f}) 
t.head#s.head => 
failure: clash 
t.head =s.head => 
Oi=t 
fori 1 to arity(t.head) do 
o:= unify(ot/i, os/i)°o 
end 
return(c) 
end 
end unify 


Figure 1-1: A Classical Unification Algorithm 


The algorithm points out two cases in which it is not possible to unify the input terms. If, at any 
point in the unification process, it is necessary to unify two terms in which the head operator symbols 
are not the same, a clash occurs and the terms are not unifiable. The second case is called a cycle 
and is succinctly shown by trying to unify the terms x and f(x). The unifier, {x «/(/(/(/(...))))} is 
infinite, although it can be finitely represented by a cyclic graph. In some applications, infinite unifiers 
are allowed or even desired [Filgueiras 82]; in many other applications infinite unifiers would lead to 
non-terminating program behavior. We will include the cyclicity test to disallow infinite unifiers, but 
note that a unification algorithm can be easily modified to allow infinite unifiers by removing the 


cyclicity test. 
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1.4 Equational Unification 


As suggested at the outset, we will use the word "unification" to stand for not the single problem 
of section 1.3, but for a class of problems which differ from classical unification according to the 
desired notion of equality. In particular, we will be using equality defined by an equational theory, 
although we could also imagine using non-equational or even higher order logics. The equational 
unification problems use a form of equality that is weaker than syntactic equality and are therefore 


relevant to applications in which a less rigid matching process is needed. 


1.4.1 Applications 


Until recently, most applications made use only of classical unification, however, the need for 
equational unification is clear. A number of operators that occur frequently in practice have 
properties described by equational theories, and equational unification provides at least one tech- 
nique for reasoning about operators with these properties. Furthermore, other common reasoning 
techniques, such as resolution and term rewriting, do not handle a number of useful equational 


properties that can be handled by incorporating equational unification. 


For example, the properties of associativity and commutativity, called the AC-theory, can be 
described by the equational axioms: 
1.x*(y*z) = (x*p)ez 
2.x*y =yex, 
The integer operations of plus and times are only two of the many examples of associative and 
commutative functions about which we would like to be able to reason automatically. Despite the 
prevalence of AC operators, basic term rewriting systems and resolution systems run into difficulty 


handling this theory. 


Another example of an interesting equationa! theory, less familiar to the mathématician, occurs in 
data type specifications for sets; the insert operation is "commutative" in the sense that the order of 
inserting elements into a set is not important. This property can be axiomatized by the equation: 
insert(insert(s, €,), e,) = insert(insert(s, eo), e4)s which is, again, a problematical axiom for term 


rewriting and resolution based systems. 


The difficulty with both the AC-theory and the insert operation comes from the symmetry of the 
axioms, which allows them to be used repetitively. A simple example using the commutative axioms in 


a Prolog program will exemplify the problem. Consider the Prolog program in Figure 1-2, which 
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contans the user-defined s‘/icg relation. The first three lines are input by the programmer: line one 
asserts that ce and mary are s:vings. line two asserts that the sibling relation is commutative, and 
line three is a query, asking for the sibings of mary. In response to the query, the Prolog interpreter 
returns the value ;ce for X, since joe was asserted to be a sibling of mary. When the interpreter is 
prompted for any other sibiirgs of mary. the interpreter again returns joe. The program will continue 
to loop in this manner, and in general there is no way to determine that all distinct answers have been 
found so that the process can stop. For similar reasons. the associative and commutative axioms ina 
term rewriting system cannot be oriented into rewrite rules without losing the termination property of 


the system. 


sibling(mary, joe). 
sibling(X. Y) :- sibling(Y, X). 
2. sibling(mary, X). 


X = joe;- 
X = joe: 
X = joe: 


Figure 1-2: A Prolog Program with Commutativity 


In both resolution and term rewriting, a solution to the problem is to build the symmetric axioms 
into the system, i.e.. into the unification process, so that the axioms are not explicitiy needed. [Plotkin 
72] describes the extension of resolution to resolution with equational unification and [Peterson 
81, Dershowitz 83b. Jouannaud 84] describe extensions of term rewriting systems to equational term 
rewriting systems. Resolution and term rewriting systems are two of the current uses of equational 
unification, but applications are by no means limited to these two. A review of some ideas for using 


equational unification is given in [Siekmann 84]. 
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1.4.2 Equational Theories 


The equational unification problem will be defined in Section 1.4.3. For that purpose, we will need 
an understanding of equational theories and some related definitions. This section gives the neces- 
sary formal background. There are two approaches to presenting equational theories: proof theoretic 
and semantic. The proof theoretic approach, based on syntactic inference rules, is presented first 
and the semantic, or algebraic approach, is presented second. The key result in the study in equa- 
tional logic is the work of Birkhoff, who proved that the two characterizations of equational theories 


are equivalent [Birkhoff 35, Gratzer 78]. 


An equation is a pair of terms, t = s. A congruence relation is an equivalence relation, ~, closed 


under the equality rule: 
f~s,1sisn= f(t,,..ut,) ~/(S,,...8,) for all f CF of arity n. 


Given a set of equations E, the equational theory presented by E is the set of equations E* formed by 
the finest congruence over T that contains E and is closed over instantiation. We will denote this 
congruence relation by t eS, meaning a € E*. E* is exactly the set of equations derivable from E by 


a finite proof, using the following inference rules given by Birkhoff: 


1. Reflexity: x =x is alway an axiom. 

2. Symmetry: Fromt ss deduce s sf 

3. Transitivity: From ei and r Py deduce t z 

4. Equality: From t, =S) 1sisn deduce f(t,,.-.t,) (8 4-8,), F oF arity n. 


5. instantiation: From tes deduce ct =08. 
We will consistently use E and E*, respectively, as a set of axioms and the equational theory 
presented by those axioms. Note that E* is uniquely defined from E, but there may be more than one 
presentation, E, given a theory E*. Even if E is an irredundant presentation it may not be unique; for 
example, group theory has a number of distinct irredundant presentations. This fact will, in some 
cases, force us to fix the presentations of theories so that proof theoretical arguments can be based 


on a well-defined set of axioms. 


The following discussion of algebras is needed for giving the semantics of equationa! theories. An 
algebra, A, is a pair (A, F), where A is a set of elements called the carrier of A and each f € Fisa 
function from A” to A for some arity, n. A mapping, », from V to A, (i.e., to the carrier of A) extended 
as a homomorphism from T to .A, is called an A-assignment. It is important in this discussion to 


distinguish between the semantic and syntactic objects. For example, if A is the set of integers and F 
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a set of integer operations, then addition is a function in F and the number "one" is an element of A, 


whereas "+" and "1" are syntactic objects for which we may choose any interpretation. 


One of the simplest models that exists for any equational theory is the term algebra, J. The term 
algebra has as its carrier the set of terms, T=T(F, V), and as its set of functions a set of term 
constructors, F, one for each function symbol in F. For example, if f is a unary operator in F, then 
there exists a corresponding unary function, f in F, such that f maps any term, f, in T to the term f(t). 


Because the carrier of J‘is exactly the set of terms, the identity map is one example of a J-assignment. 


If vt = vs for all A-assignments, v, then 4 is called a model! of the equation t = s,andt = sis said 
to be valid in A; we denote this condition by A Ft = s. Validity can be extended to a set of 
equations by: A — E if and only if ((=s) € E = A K(t=s). Given a set of equations, E, we denote 
the class of all models of E by Ab(E) and the set of equations valid in a class of models, M, by Eq M. 
Given a set of axioms, E, we can semantically define an equational theory as the set of equations valid 
in all models of E. To reiterate the equivalence between the algebraic and proof theoretic charac- 
terizations of equational theories, note that the soundness and completeness of the above inference 


rules, as proved by Birkhoff, can be written E* = Eq AL(E). 


If there exists a non-trivial model of E, i.e., A=(A, F), and AEE and A] > 1, then the theory 
presented by E is said to be strictly consistent. Syntactically, an equational theory has only the trivial 
model if and only if x a0 since any equation is a substitution instance of this one. The unification 
problem in an inconsistent theory is always trivial. By assumption, we will work with only strictly 


consistent equational theories. 


The equivalence relation on terms defined by an equational theory can be extended to an equiv- 


alence relation on substitutions: o . o, if and only ifVv EV (o,v : gv). in many cases we are 


1 
interested only in the effect of a substitution on a particular set of variables, V. We extend our defini- 


tion as follows: o, ~ v5 if and only if Vv EV o,v po Furthermore, we say that o, is more general 


TE 


than o,, modulo E over V, written o 


2 


Vv 
1 2% if and only if: 
E 


‘ V 
There exists + such that T°O, : Ty. 


: : ‘ Vv Vv . ' 
The equivalence relation defined by a, $0, and d, 39, will be denoted o,x0 or, if V=V, by o,=0,. 
E E E 2 VE 2 
The relation = corresponds to our intuitive notion of equivalent modulo E up to variable renaming, 


Vv 3 ; ; : 
and = corresponds to same relation where only the domain of V is considered. 


The faliowing properties hold for the equivalence relation on substitutions. They will be used 
freely in the proof of correctness in Chapter 3 and are presented here to avoid distraction during 


those proofs and to aid the reader in developing intuition about = on substitutions. 
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wv V f 
1.6 205% 0,90, =0. "C5, for any o, instantiation 


is 304 E 3 
Vv ee 
2. (0,°0,)°0,, = 9 ,°(0,°0,) associativity 
H ce 3 E 
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Having presented the basic concepts involved in working with equational theories, we are now 


ready to examine the unification problem in these theories. 


1.4.3 A Problem Statement 


Equational unification is the problem of solving an equation of the form t = s in the quotient 
algebra. J/ a whose Carrier is the set of congruence classes of terms defined by E. if E* is the empty 
theory. then we again have the classical unification problem. The problem is distinguished from the 
problem of satisfiabiiity in a first order theory. In unification the interpretation of symbols is fixed as 
the term aigebra interpretation. whereas determining satisfiability of a first order statement is the 


problem of finding whether there exists any interpretation in which the statement is valid. 


Definition. Let t and s be terms and E be a set of equations. A substitution. co. is said to be an 


E-unifier of tand s if and only if: 


1.4.3.1 Properties of Unification Algorithms 

Let U, denote the set of all E-unifiers of terms t and s, i.e., U.{t,s) = {a €S | ot rag s} and let 
V=%THUMNs). As in classical unification, Ure is infinite; we represent it by a complete set of unifiers 
from which set the U, can exactly be generated by considering all instances of each substitution in 
the set. If every element of a set of unifiers is necessary for completeness, it is called a minimal 


complete set of unifiers. The following set of definitions formalize these concepts. 


Definition. Let = be a set of unifiers of tand s and V = 7{t)U(s). 2 is said to be complete if and 
only if it generates all unifiers: 


Vo€U,(t, s) J atl a Xo 
E 


Definition. Let = be a set of unifiers of t and s and V = Y{t)UY%s). 2 is minimal if and only if no 


substitution in 2 is redundant: 
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When it exists. a minimal complete set of unifiers is unique up to = for any E* [Fages 84]. The size 
of the minimal complete set is bounded for certain values of E. IfE = @, there is always a singleton 
complete set for any two unifiable terms. {f E contains only the associative and commutative axioms 
(the AC theory) then the complete set is always finite. If E contains only the associative axiom, then 
there are some pairs of terms for which every complete set of unifiers is infinite. If there is a finite 
complete set then a minimal complete set always exists and can be found by filtering out non-minimal 
unifiers through matching. For some infinite cases, the properties of minimality and completeness 


may conflict, so that no minimal and complete set exists [Fages 83a]. 


For completeness, it may be necessary for an E-unification algorithm to use more variables in the 
range of the unifiers than occur in the terms being unified. Because unification procedures are often 
used within a larger system containing variables of its own, it is useful to require an additional 


property to protect the existing variables from being used as new variables. 


Definition. Let = be a set of unifiers of tand s, V = 1{t)UY(s), and let W be some set of variables to 


be protected, where VCW. ~ is protective if and only if: 
Vo€= DB(o)CV & W-VNI(c) = B 
& D(a)NI(a) = D. 


Without loss of generality, we will assume sets of unifiers are protective, both for the pragmatic 
reason given above and for the technical reason that it makes unifiers idempotent, (i.e., 


B(o)MI(a) = B => oa = a) which will be used in the proofs. 


The properties on sets of substitutions are extended to properties on a unification procedure; 


collectively, they constitute partial correctness of a procedure. 


Definition. A procedure, E-unify is a partially correct unification procedure for E* if and only if for all 
terms t and s and any finite set of variables WD1(t)UMs), if E-unify terminates with a set of substitu- 
tions 2, then: 


1. consistency: c€X => ot a8: 
2. completeness: 2 is complete for t and s. 


3. protection: 2 is protective of t, s, and W. 


If, in addition to being partially correct, E-unify returns only minimal sets of unifiers, then it is said 


to be a minimal procedure. A procedure which is partially correct and terminating is called totally 
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correct and is referred to as an algorithm rather than a procedure. Any theory with a terminating 
unification algorithm has a minimal complete set of unifiers for any pair of terms. However, minimality 
of the algorithms is not included in the correctness criteria because it is a difficult to guarantee 
without the costly filtering process, and because it is not necessary in most applications of E- 


unification. 


1.4.3.2 Classifying Solutions to the Unification Problem 

Historically, there are a number of ways in which one can solve the unification problem for a 
particular theory, E*. We will classify each kind of solution because it will help to clarify the contribu- 
tions of our approach and the assumptions behind it. The terminology used here is not well-defined 
in the literature, but we establish a convention based on common usage for referring to each kind of 
solution after giving its characterization. 

1.For a given equational theory, E*, the simplest unification problem is called the 

variable-only case. The assumption is that there is one set of axioms, E, and terms to be 


unified contain only function symbols appearing in E and variables. 


2. Unification in the case with free symbols is, again, unification with a single set of axioms, 
E, but the terms may contain free function symbols, i.e., unconstrained symbols of any 


arity, in addition to function symbols in E and variables. 


3. The multiple instance case unification problem allows more than one instance of a set of 
axioms, for example, the AC theory for + and the AC theory for *. In this case terms still 
contain only function symbols from the axioms and variables, but a single term may 


contain more than one operator with the given properties, i.e., both + and *. 


4. The unification problem for combined theories is to take sets of unrelated axioms, for 
example, the AC theory for + and the "commutativity of insert" theory for sets, and allow 


unification of terms containing function symbols from both of these theories. 


It is this last problem, the problem of combining equational unification algorithms, which is studied 
and partially solved here. This thesis provides an algorithm for combining equational unification 
algorithms for a restricted class of equational theories and characterizes some of the theories for 
which the combined problem is not solved. This problem was suggested as an open problem in 
[Siekmann 84] and [Shostak 84]. | 
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1.5 Related Work 


This section discusses some of the work that has be done on developing unification algorithms. 
Section 1.5.1 give a short survey of algorithms that have been designed to solve the unification 
problem in one particular theory. Section 1.5.2 describes a class of unification procedures that can 
be automatically generated from an axiomatization of the theory, and Section 1.5.3 looks at some 


work related to the general problem of designing algorithms for combinations of theories. 


1.5.1 Single Theory Algorithms 


A number of unification algorithms have been developed for particular equational theories, and a 
great deal of effort has been devoted to improving and bounding the running time of these algorithms. 
It is interesting to note that theoretical measures of complexity for these algorithms often do not 
reflect their relative running times in practice. This is probably because of the small size of terms in 
the average case. although little work has been done in trying to formally characterize the average 
case for a unification problem or in measuring the performance of algorithms based on an average 


case. 


For the empty theory, the first algorithm was described in [Robinson 71] and is exponential in the 
size of the input. It has been modified by representing terms as directed acyclic graphs rather than 
trees [Corbin 83] to give an n? algorithm. The algorithm of [Paterson 78] runs in linear time and those 
of [Martelli 82] and [Baxter 73] run in nearly linear time. [Martelli 82], while theoretically slower than 
the linear algorithms, runs faster on some typical examples. Also, the modified algorithm of [Corbin 
83] is fast in practice and has the additional advantage that the structure of the algorithm is simple 
and intuitive; one disadvantage of the [Corbin 83] approach is that it depends heavily on a data 


structure for terms that may or may not be appropriate within an application. 


Some of the currently known complete E-unification algorithms are for commutative operators 
[Siekmann 79}, AC operators [Stickel 81, Livesey 76] (with termination in the multiple instance case 
proved in [Fages 84]), signed trees [Kirchner 81], one-sided distributivity [Arnborg 85}, and transitivity 
[Kirchner 85]. There are variations on the AC algorithm [Livesey 76, Fages 84] for AC with idem- 
potence, and AC with a unit element, AC with both idempotence and a unit. An algorithm for the 
variable-only case of free abelian groups is given in [Lankford 84], and more generally for finitely 
presented abelian groups in [Kandri-Rody 85]. [Kandri-Rody 85] also gives unification algorithms for 
finitely presented boolean rings and finitely presented boolean rings of polynomials (i.e., with 


idempotence). 
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The decision problem for unification in the associative theory, also known as string unification, 
has been shown decidable [Makanin 77]. The associative theory has, in general, an infinite set of 
most general unifiers, so a terminating algorithm cannot exist. However, a complete procedure for 
enumerating unifiers of an associative operator is described in [Plotkin 72]. This procedure is given 
for one associative operator with unconstrained symbols of any arity. Not all equational theories have 
decidable unification problems. [Szabo 78] shows that unification in the associative distributive theory 
is undecidable and [Arnborg 85] shows that combining associativity with one-sided distributivity and a 


unit element gives a theory with an undecidable unification problem. 


For theories in which a unification algorithm is known, the execution times of many have been 
disappointingly high. These observations are explained by some recent results classifying the com- 
plexity of different unification problems. The unification problem in the commutative theory is known 
to be NP-complete [Garey 79], and in the AC theory to be NP-hard [Kapur 85, Chandra 84]. Unifica- 
tion in the theory of right and left identity is NP-hard while the theory of one-sided distributivity can be 
done in polynomial time [Arnborg 85]. A restricted case of unification is the matching problem, in 
which a substitution is applied to one term to make it equal to another. [Benanav 85] shows that even 


this simpler problem has an NP-complete decision problem in both the AC and commutative theories. 


Much of this past work in equational unification has made use of simplifying assumptions on the 
structure of terms, i.e., algorithms are usually developed to handle terms whose operators all belong 
to a single set of axioms. In most cases, the above unification algorithms were designed for the 
variable-only case, possibly with constants. Contrary to many claims, we show in Section 2.4 that 


extensions to the case with free symbols is often non-trivial. 


Under the current approach, every time a new axiom is added to the theory, a new unification 
algorithm must be found and implemented for the new set of axioms. The work of [Fages 83b, Fages 
84] takes steps toward remedying this situation by extending the unification algorithm for AC to 
handle terms containing a mix of theories including empty, commutative, and AC theories. However, 
his approach is still ad hoc rather than generalized. Adding another theory to his algorithm, for 
example the theory of left distributivity, would require modification of his algorithm. The modified 
algorithm would have to consider terms containing all the possible combinations of operators. In 


contrast, this thesis describes a method for automatically combining theories. 


Figure 1-4 summarizes some previous results on developing equational unification algorithms, 
with careful attention paid to whether an algorithm is a solution to the variable-only case, the case 


with free symbols, or the multiple instance case. Algorithms that permit constants as well as variables 
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are recorded in the variable-only case column; the constants are added in some theories to keep the 
_ decision problem from becoming trivial; they play an uninteresting role in the prablem of generating a 
complete set of unifiers, except to eliminete those unifiers that equate two constants. Algorithme from 
 {Hullot 80] and [Jouannaud 83] are based on narrowing, @ process fer-unification described in the 


following section. 
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Figure 1-3: Some Common Equational Theories 
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Figure 1-4: Known Unification Algorithms 


23 


Multiple Instance 


open 


[Fages 83b] 
[Fages 83b] 
[Hullot 80] 


[Jouannaud 83] 


open 
[Jouannaud 83] 
[Jouannaud 83] 
open 

open 

[Hullot 80] 


this work 
[Kirchner 85] 


open 


this work 
[Kirchner 85] 


this work 
[Kirchner 85] 


this work 
[Kirchner 85] 


this work 
[Kirchner 85] 


[Hullot 80] 
open 
open 
open 


1.5.2 Narrowing 


While most of the existing unification work in unification has required human invention of each 
algorithm, the unification procedures based on narrowing [Slagle 74] are automatically generated. 
For equational theories representable by a convergent term rewriting system there is method for 
performing unification in the theory of the rewriting system [Fay 79]. [Hullot 80] gives sufficient con- 
ditions for termination of the narrowing process, along with some improvements, and [Jouannaud 83] 
generalizes this work to equational term rewriting systems. [Rety 85] further improves on the ef- 


ficiency of the narrowing process and detects cycles in the unifiers. 


The approach described in this thesis serves a quite different purpose than the work on unifica- 
tion through narrowing; the narrowing procedure does not assume the existence of equational 
unification algorithms, but generates a procedure based on its axioms, whereas our approach might 
use a unification algorithm produced by narrowing as one of the basic pieces to be combined. For 
example, the work in narrowing has lead to unification algorithms for theories described by a conver- 
gent term rewriting system in which all right-hand sides are either a single variable or ground term. 
These theories include idempotence, identity, and quasi-groups. The algorithm described in this 
thesis applies to narrowing algorithms for theories in which right and left sides of all axioms are 
ground terms, e.g., 1*1=0. In other words, given a theory presented by ground equations, we can 
automatically generate a unification algorithm through narrowing; this algorithm can be combined 


using our approach with other unification algorithms. 


The narrowing process is interesting from a theoretical standpoint, and gives quick positive 
answers to the question of existence of an E-unification algorithm for theories presented by ground 
equations or unconfined equations. For the cases in which narrowing gives a unification algorithm, it 
will also solve the problem with free symbols or multiple instances, but can solve the combining 
problem for only very limited combinations of theories. Moreover, even when the process terminates, 
it is too inefficient to be practical as part of a larger system such as a term rewriting completion 


procedure or resolution system. 


1.5.3 Combining Theories 

Although the problem of combining unification algorithms is a known open problem [Shostak 
84, Siekmann 84], the problem of combining decision procedures for first order theories has been 
studied. Nelson and Oppen provide a procedure for deciding whether a formula is a theorem in 
combination of first order predicate calculus theories [Nelson 79]. Their algorithm uses a set of 


decision procedures for the theories being combined, much in the same way we will use unification 
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algorithms for tne theories being combined. [Shostak 84] improves on the algorithm of [Nelson 79] by 
localizing the information shared between algorithms. This yields an improvement in the algorithm's 


efficiency and extendibility. 


The similarity between the structure of our unification algorithm and the decision procedure of 
{Nelson 79] is apparent when unification is considered in the Martelli and Montanari style of 
propagating equalities [Martelli 82]. [Kirchner 84a] gives an aigorithm based on this style for unifica- 
tion in the decomposable theories. a class of theories in which a natural decomposition process 
occurs during unification. For example. if f is a symbol that does not appear at the head of either 
side of any equations in E, then the problem of E-unifying terms of the form S(S4.+8,) and L(tyreot,)s 


modulo E, is proved equivalent to unifying all pairs S,1,18 isn. 


[Kirchner 85] has independently developed an algorithm for combining unification algorithms. He 
generalizes his earlier work by defining the notion of a decomposition process for theories in which 
the natural decomposition does not occur. For example, in the the commutative and AC theories 


there is a finite set of possible decompositions related to the set of possible unifiers. 


The algorithm of [Kirchner 85] is based on two main phases: simplification of the unificands, and 
formation of substitutions. His algorithm is correct for a slightly smaller class of theories than the 
algorithm we will be defining. !n particular. if a theory contains a ground equation in an irredundant 
presentation, then Kirchners approach does not work. We will discuss the exact details of his 
restrictions on equational theories in Section 2.1.3.3 after our own restrictions are presented. 
[Kirchner 85] reports that his implementation works faster than ours on some typical examples using 
the AC theory. This may be because of earlier discovery of clashes using his approach, since he 
detects all clashes during the simplification phase before beginning the more expensive phase which 


involves cyclicity testing. 
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Chapter Two 


A Generalized Unification Algorithm 


This chapter presents our approach to generalized unification. The presentation is bottom-up. In 
Section 2.1 we state some assumptions on the equational theories, Section 2.2 presents a description 


of the algorithm, and Section 2.3 gives a detailed example. 


2.1 A Generalized Approach 


In equational unification, a unification algorithm must be discovered and implemented for each 
equational theory of interest, and with some notable exceptions, (see Section 1.5) this process is not 
automatic. As we will see, the problem of combining algorithms is also non-trivial. Our approach is to 
break a combined unification problem into pieces that we know how to solve with the sub-theory 
algorithms, and then to combine the answers for each these sub-problems to get a solution to the 


whole problem. 


Our algorithm is recursive; a top-level procedure performs the steps in unification that are com- 
mon to all equational theories and then invokes an appropriate equational unification algorithms for 


sub-problems particular to one theory. 


2.1.1 Partitioning Equational Theories 


Our first underlying assumption is that the sets of operators handled by each unification proce- 
dure are disjoint. Consider the following example: we are given a unification algorithm for AC- 
unification and an algorithm for unification with an idempotent constant (IK-unification). If the + 
operator is known to be AC and have an idempotent constant, a, (ie.,@+a@ = a), our technique will 
not automatically generate an algorithm for AC-IK-unification because the AC and IK axioms interact 
through the shared symbol +. On the other hand, if + were AC and * were IK with a, and there were 


no other axioms in E, our approach would generate an algorithm for this theory. 


We will treat each unification algorithm for an equational theory as a “black-box,” invoking it with 
certain inputs, but never examining the operation within the box. The above problem with + being 
both AC and IK can be eliminated by considering only sets of axioms with disjoint operator sets. 
Formally, we define a partitioning on the axioms presenting E*. 
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Definition Leta = {E, E aicee E where each Eis a set of equations. 7 iS a partitioned 
preseniaton of an equational theory E* if and only if: 
1. SEINE) = ©, Wit) sn 


2.y E is a presentation of E*, and 


3. WEn. 


Each of the Es presents a theory, E*., called a sub-theory of E*. The empty set of equations in (3) 
represents the empty equational theory, which is a sub-theory of any theory. Semantically, the union 
of a set of equational theories corresponds to taking the intersection of their models, AL(E) = 


AE}. 


The partition, 7, naturally defines an equivalence relation on function symbols. Let 1g, =f, if and 
T 
only if either: 


1. There exists E. € m such that /, € S(E) and f, € F(E)). 


2. Or, forallE € 7. f, €S(E) and sf, ¢ S(E,). 
The equivalence class of symbols containing f will be denoted [/], e.g., if one of the sub-theories is 
ACZ with + as the AC operator and 0 as the identity constant, then[ +] = {+, 0}. Function symbols 
that do not appear in any of the E's are called uninterpreted and all belong to one equivalence class. 


This equivalence relation will provide a convenient way of naming unification algorithms of sub- 
theories. It is not quite correct to refer to an operator as being ’in' a particular sub-theory. since each 
sub-theory has the same fixed signature, F. We therefore fix the partitioned presentation a for E* and 
refer to the set of function symbols appearing in E, as the constrained function symbois for E*. For 
technical reasons, we will let the set of uninterpreted symbols be the constrained symbols for the 
empty theory. The unification algorithm for the theory constraining F will be denoted E,-unify or 


Equnity, if fF. 


An example should help clarify our definitions. Let E* be presented by the axioms in Figure 
2-1 and let the signature, F, be the set {+, +, a, 5, *, f, g} with appropriate arities. 7 = {E,,E,, Es 
Es iS a partitioned presentation of E. 7 is said to be a minimal partition because none of the elements 
of 7 can be divided further without losing the disjointness property on function symbols. In general 
we will use minimal partitions because this results in the simplest sub-theories, although minimality of 


partitions is not required. 


There is a final technicality to clarify before beginning the discussion of our algorithm. Although 
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7 = {E,, E., E. Ey Classes of F: {F,, F.. Fas FA} 


Esx+yoyex F, = F(E,) = {+} 
(x +y)+2=x + (yp + x) 

Ex eye yee F, = S(E,) = {*, a} 
a*az=a 
(xt y)ez =x +2) 

Es xey=yex F, = $(E,) = {*} 
(xey)ez=xely*x) 

Ee Fi {b, f, g} 


Figure 2-1: A Partitioned Presentation 


we speak of an E-unification algorithm for a particular equational theory, each algorithm is really for 
an isomorphism class of equational theories. For example, if both + and ° are AC, as in Figure 2-1, 
we can use the same algorithm for unifying a pair of terms containing + or a pair of terms containing 
*, The two equational theories, + AC and * AC, are not equal theories, but the isomorphism is so 
natural that we would normally consider them to be the same. A difficulty arises when the two 
theories are combined, i.e., when terms to be unified contain more than one operator with the same 
equational properties. To resolve this issue, each E-unification algorithm is parameterized over the 
set of names of its constrained operators. In this example, AC-unification for + and AC-unification 
for * are both instances of the same E-unification algorithm. For the purpose of this discussion, we 
will assume a different unification procedure exists for each instance of a theory, although in the 


implementation we do not duplicate the actual code. 


2.1.2 Some Basic Functions 


Our algorithm begins by transforming the input terms into simpler terms containing only operators 
from a subset of the axioms, a subset for which there is a known E-unification algorithm. The 
information lost in the transformation is saved in the form of a substitution. This substitution is 
combined, through E-unification of substitutions, with each sub-theory unifier of the transformed 
terms. Section 2.1.2.1 describes this transformation process on terms and section 2.1.2.2 describes a 


procedure for unifying substitutions. 


28 


2.1.2.1 Homogeneous Terms 

A term, t, is called homogeneous with respect to a set of function symbols, F, if and only if 
F(t) C F. We define a homogenizing function, Homog, to convert an inhomogeneous term (i.e., a term 
containing operators that are not in F) into a homogeneous term. The basic operation of Homog is to 


replace all maximal subterms whose top function symbol is outside F with a new variable. 


Definition. Let F be a set of function symbols and f be a term, then Homog(t, F) is defined as: 


1. If tis a variable, then Homog(t, F) = t. 
2. |ft = SF (tyrveat) and f € F, then Homog({t, F) = f (Homog(t,, F),...,Homog(t,, F)). 


3.Ift = f(t,,...t,) and f € F, then Homog(t, F) = v, where v is anew variable. 


As defined, Homog(t, F) is not a function but is unique for t up to names of the new variables. 
Technically, we should be more precise about the new variables that are used, for example, in case 
(2) we assume any new variables in Homog(t) are disjoint from both the old variables in t and new 
ones in Homog(t), for i#j. To assure our algorithm is protective, these new variables must also be 


disjoint from the set of protected variables. We formalize the naming of new variables in Appendix A. 


Taking F = {a , *}, we have the following values for Homog: 


Homog(x *(a+y), F) = x*V, 
Homog(x *{a*b), F) = x*(a *v,) 
Homog(x +y ,F) = Vg: 


In general, homogenization of a term is done with respect to some equivalence class of F as defined 
by a, usually the equivalence class of the root symbol. We use the notation f to denote 


Homog(t,[t.head)]). 


In forming a homogeneous term, part of the structure of the original term is lost. To take a 
homogeneous term back to the term from which it was formed, we find a preserving substitution. 
Notice that t is an instance of f and we can therefore find the match of t for f, in this case called a 
preserving substitution, by Preserve(t, ft). Preserve(t, f) maps each new variable in f to the term it 
replaced in t. We distinguish the preserving substitutions from normal matching because the prevers- 


ing substitution is unique for t within variable names in its domain. 


Definition. Let t be a term and fbe its homogenous form. The preserving substitution for t and fisa 


substitution y such that: 
BlyCN) & yf=t. 


2.1.2.2 Unification of Substitutions 
This section describes our method for combining sub-problem unifiers, which involves unification 


of substitutions. 


Definition. Given a set of equations. E, a substitution, o, is said to E-unify two substitutions, P and 
q'p If and only if: 
o° 9, : a° Fo: 


We need an effective procedure, call it map-unify, for finding unifiers of two substitutions, , and 
Py. In looking for a unifier of two substitutions, as in testing for equality of substitutions, we restrict 
ourselves to the domain of the variables V = Tp JUT(p,). A corresponding pair of terms is defined 
to be a pair, <t,, t,>, where t, = ¢,v and t= Goh for some rv € V. If we can unify each corresponding 
pair of terms in substitutions sequentially, accumulating the unifiers. and applying the results to 
remaining pairs, the end result will be a set of unifiers of the substitutions. The routine in Figure 
2-2 performs the desired function. map-unify assumes the existence of our main algorithm, CR-unify, 
because the two procedures, map-unify and CR-unify. are mutually recursive. (For technical reasons 


that will be made clear in the termination proof, the variables in Tq.) are unified after the variables in 


T(p,).) 


map-unify = proc (P,> ¢ 7Subst) returns(2:subst_set) 
V,:= B(p,)-S(¢,) 

2:= Fp.) 

{1} 


u 


0 : 


forj = 1to2do 
foryin V, do 
i:=i+1 
Z i= {w°0, ,] 0, ,€2, , & w€CR-unify(o, .p,¥, 5; +Po)} 
end 
retu rn(2) 
end map-unity 


Figure 2-2: Procedure map-unify for Unification of Substitutions 


2.1.3 Restrictions 


The CR-unify procedure presented in this thesis is correct for only a restricted class of equational 
theories. Two syntactic restrictions on the axioms in 7, confinement and regularity, are sufficient to 


show correctness. 


2.1.3.1 Confined Theories 

The first restriction will be to eliminate those equational theories in which two E-equal terms have 
head symbols that are constrained by different sub-theories. Because we assume the axiom sets are 
disjoint, this case can occur only if there is a equation in the theory of the formt = s, where either t or 
sis a variable and the other term is a non-variable. If this kind of equation is in one sub-theory, then 
there will also be instances of this equation in the entire theory, where the heads are not equivalent 
modulo 7. Equations of this form, between a variable and a non-variable, will be referred to as 
non-conftining, because they provide a means of deducing equations whose roots are not confined to 
the same equivalence class of F. A set of equations containing no non-confining equations is called 


confined; this terminology also applies to theories since they are closed sets of equations. 


An example of an unconfined theory is the theory of idempotence. Let E be the theory presented 
by w = {{x *x = x}, S}, and let f be a function symbol in F. The equation f(x) * f(x) efx) isin E, 
even though ° and f are in different equivalence classes. The problem caused by having roots in 
different equivalence classes will be apparent in the description of CR-unify, where we begin the 
unification process by unifying in the sub-theory of the roots. Lemma 1 will state that this restriction 
can be made on the axioms, rather than the theory, and Theorem | will give the desired property on 


head symbols of equations. We postpone the presentation of these proofs until Chapter 3. 


2.1.3.2 Regular Theories 

The second restriction is on the sets of variables in equations of the theory. The problem comes 
from variables that occur on one side of the equation, but not on the other. An equation, t=s, is 
regular if and only if Y(t) = Y(s). We extend this to sets of equations by: E is regular if and only if all 
equations in E are regular. As in the case of confined theories, we can restrict ourselves to reqular 
theories by restricting the set of axioms. Lemma 3 states the equivalence of these two properties and 
Theorem II gives sufficient conditions under which a variable and a term containing that variable are 


not unifiable. 


We have eliminated some interesting theories, such as idempotence, identity and minus, however, 


there are still many interesting theories that are both confined and regular. The distributive theory, 
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the homomorphism theory, and any theories presented by ground equations are confined and 
regular. In addition, any theory in which the right side of each equation is identical to the left side 


within a permutation of variables, i.e., a permutative theory, is confined and regular. 


Recall from Section 1.4.1 that one of our reasons for studying E-unification was to avoid termina- 
tion problems in systems that use equations as oriented rules. A permutative equations oriented in 
such a manner will always a program to loop, since they can be applied repeatedly. An unconfined 
equation, if oriented into a rule, will not lead to termination problems since the variable side of the 
equation is simpler than the non-variable side. Consequently, the equations that most often lead to 
termination problems in an application can be handled by our unification algorithm, whereas equa- 
tions that cannot be handled by our unification algorithm will often not cause termination problems in 


an application. 


2.1.3.3 Strict and Strongly Complete Theories 

In this section we digress in order to give a careful distinction between our restriction to confined 
regular theories and the restrictions of [Kirchner 85]. As discussed in Section 1.5.3, Kirchner’s algo- 
rithm is based on two main phases: simplification of unificands and formation of substitutions. His 
algorithm for simplification is correct and terminating for the confined theories (he calls them 
non-potent theories.) The simplification algorithm produces a simplified system of unificands and is 


complete in a sense defined in that work. 


The second phase, formation of substitutions, involves forming substitutions from simple pairs of 
unificands, e.g., one variable and one non-variable. In general, discovery of all unifiers may require 
another phase of simplification after some substitutions have been formed, and this makes it difficult 
to guarantee termination. To avoid having to alternate between simplification and substitution for- 
mation, Kirchner places further restrictions on the equational theories so that a single phase of 


simplification followed by a single phase of substitution formation gives a complete set of unifiers. 
The correctness of his unification algorithm depends on the theories being strongly complete and 
strict as well as confined. 


Definition. A theory E* is said to be strongly complete if and only if for all variables, x, and terms, t, 


there exists 2, a complete set of unifiers of x and t such that: 
Vo€X, Ko) = {x}. 


As an example of a theory that is not strongly complete, consider the theory presented by a + b =a. 


This theory is not strongly complete because the terms x +) and x are unifiable by the substitution 
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{xeu. ve"). but not by any substitution having only x in its domain. All theories presented by ground 
equaticns that have at least one operator of arity two or higher in the presentation, will fail the strong 


completeness test. 


The notion of strictness of an equational theory depends on the following ordering on pairs of 


terms. 


Definition. <t,s>< <1’, s> if and only if: 


1. tis variable, t' is not a variable & t€ Vt’), or 
2. t is variable, s’ is not a variable & t€ Ws’), or 


3. 1 or 2 is true with s fort. 


Consider a set of unificands. i.e., pairs of terms to be unified, for which we want to find a single 
substitution that unifies all pairs in the set. If such a unifier exists, then we take the transitive closure 
of ~ 4: on the set and determine whether or not the transitive closure is strict. (A strict partial order is 
one which is irreflexive and asymmetric.) A theory is strict only if the transitive closure of <9) 


denoted <4-*. is strict. More precisely: 


Definition. A theory E* is strict if and only if for all sets of pairs of terms, P: 
do such that (V<t, s>EP, of =as) 


= <4" is strict on P. 
The simplest examples of theories that are not strict are those in which a variable is unifiable with a 
term containing that variable. For example, in the theory presented by ¢(g(x)) = ¢(x), eG) and y are 
unifiable, but <g(x), x> is less than itself by <q, So <gp is not strict on P = {<g(y’), y>}. Any theory 


with an equation in which one side is a subterm of the other will be non-strict. 


Kirchner’s restrictions on the equational theories are less general than ours because both al- 
gorithms require confinement and because strictness implies regularity. For evidence of the latter, we 
construct a pair of unifiable terms that cause a trivial cycle in the <4" ordering in any non-regular 
theory. Given any non-regular equation in E*, it will be of the form tas, where dv, v€t & v€s. The 
pair of terms vy and t¢ are unifiable in E* by the substitution {v+s}, but <q," is not strict on {<v, ft} 


since v Et. 


Note that the restrictions of strong completeness and strictness are conditions on the infinite set 
of equations in the theory, whereas our restrictions are syntactic checks on any presentation of the 
theory. Moreover, strictness implies regularity but not the reverse, so his algorithm is slightly less 
general than ours. Examples of theories handled by our algorithm but not by Kirchner’s include most 
theories presented by ground equations, e.g., a + b =a, and the confined regular theories containing 
equations with one side a subterm of the other, e.g., 2¢(g(x)) = g(x). 
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2.2 The Algorithm 


The main algorithm. an aigorithm for generalized equational unification of terms. is presented 
below. The basic assumptions are summarized here. E* is a strictly consistent equational theory, 
with a fixed partitioned presentation. MLE scars Ey For each Br there is a known unification 
algorithm. called E -unify, which returns a complete set of unifiers. given any two terms that are 
homogeneous in the constrained symbols of E*. We also assume that E* is confined and regular, 
although the reason for these last two requirements will not become apparent until the proof of 


correctness in Chapter 3. 


The CR-unity procedure is given in Figure 2-3. if t and s are variables or if tf is a variable not 
occurring in s. then the terms are unifiable by the substitution {tes}. If f and s are both non-variables 
with root symbols from different equivalence classes of F. then any substitution instance of t and s will 


also have root symbols with this property. so t and s are not unifiable (see Theorem |). 


if both t and s are non-variables with root symbols in the same equivalence class, then we form 
homogeneous terms and determine the preserving substitutions, Preserve(t, f) and Preserve(s, 8). 
The union of these substitutions is well-defined because the domain of each contains only new 
variables from f and S, and these two variable sets are disjoint by construction. We will refer to their 
union, y, aS the combined preserving substitution. P is found by unifying homogeneous terms in the 
appropriate sub-theory, and the preserving substitution is combined with each p € P by unification of 
substitutions. The set of unifiers, 2, is returned after restricting the domain to the variables in t and s. 
(Note: for readability, we have extended the notation for restricting substitutions to denote restriction 


of a set of substitutions, i.e., 2], = {o], | ¢€2}.) 


CR-unify = proc (t, s: term) returns (subst_set) 
case 


is_variable(t) and is_variable(s) > 


return({{tes}}) % case 1 
is_variable(t) and ~is_variable(s) => 
return(CR-variable-unify(t, s)) % case 2 
is_variable(s) and ~is_variable(t) => 
return(CR-variable-unify(s, t)) % case 3 
t.head x s.head => 
return (@) % case 4 
t.head =| s.head => % case 5 
y := Preserve(t, f) U Preserve(s, $) 
P:= Eneaay UY (f, S) 
z= rap ene: y) 
retu (Zanes) 
end 
end CR-unify 


CR-variable-unify = proc (v: variable, s: term) returns (subst_set) 
y := Preserve(s, 8) 
case 
v€ Us) => % case A 
return ({{v © s}}) 
v EVs) &v € Hy) => % case B 


Pi= Es head] 
zZ:= Umap-unify(p, y) 
€P 


-unity (v, 8) 


retu (Ze ure) 
vE Ns) & v € Hy) => % case C 
return(@) 
end 
end CR-variable-unify 


Figure 2-3: The CR-unify Procedure for Equational Unification 


2.3 AnExample 


This example shows unification in the equational theory, E, as presented in Figure 2-1. Let the 
input terms be, t= +(x *y) and s=a+z. Both are non-variable terms and the sub-theory of the root 
operator (+ in both cases), is presented by E.. The relevant axioms for this example are E,. the AC 
theory for +, E, the AC theory for « with idempotent constant a, and Ey, the empty theory with 


uninterpreted symbols 5, f, and g. 


Calling CR-unify(t, s), we find that case 5 of CR-unify is appropriate for two terms with roots 
constrained by the same sub-theory. Following this branch, we compute the homogeneous terms, f 


and §. The set of constrained symbols for E, is { + }. 


fav,+V, S=va+Z, 


The preserving substitutions are: 

Preserve(t, f) = {v,b, Votx +y} Preserve(s, 8) = {v,+a} 
and the combined preserving substitution y = Preserve(t, f)°Preserve(s, $) is: 

{v,+5, Vorx *y, vata} 


The homogeneous terms are unified in the sub-theory E*, the AC theory for +. AC-unifying f and 


$ results in a complete set of AC-unifiers. This set will contain two unifiers that are within = of: 
p, = {vt zy} 
Po = {¥gtV 5, Z¢V,}- 


We proceed by calling map-unify with P, and y. (Both p, and Po will be considered eventually and 


the choice of which unifier to look at first is arbitrary.) 
map-unity(p,, y) = map-unity({v .*v ,, zevo}{v eb, Vex ty, vary} 


Recall that a substitution maps each variable outside its domain to itself. The corresponding pairs 

of terms are thus: 

from p,: PZ =Vo P4%4=7, PyVn=Vo PyVa=Vy 

from y: yz =2 yv,=5 YW=xry Y,=4 
In this example, each recursive call to CR-unify will yield a singleton set of unifiers, although this will 
not be true in general. (If, at any point, more than one unifier was returned, we would proceed with 
each in depth-first fashion.) We will show the inputs to each call to CR-unify, the resulting unifier set, 
and the effect of applying the unifier on the pairs of terms. The pair of terms for z is unified first, 


because it is the only element of V,: the order of the rest of the cails is arbitrary. 


1. CRunt (203) returas {{- "1 3}} 


from p,: Vo vy Vy vy 
from y: v5 b xy a 

2. CR-unify(h. v ,) returns {{1 ,«5 }} 
from p,: V5 b b b 
from y: Pe b x+y a 

3. CR-unify(b, x *)) fails, since / and * are not in the same equivalence class. 

This ends the call to map-unify for P;: 
We call map-unify again. this time with p, and y. 
map-un'ty(p,, y) = map-unify({V ge ¥ », FEV FAVOR; om Seal Z vas'9 
The corresponding pairs of terms are: 
from p,: Pz =%, Pov,=¥, Po"n=Vo PoV3=Vo 
from y: yraz yv,26 Yy=x ry Y¥g24 
The following sequence of calls to CR-unify results. 

1. CR-unify(z, v ,) returns {{z¢v a 
from p,: vy uP Vo Vo 
from y: Vs, b x+y a 

2. CR-unify(b, v ,) returns {{v 75 }} 
from p,: b b Yo Vo 
from y: b b x*y a 

3. CR-unify(x *y, v,) returns {{v +x +y }} 
from p,: b b x*y xery 
from y: b b x*y a 


4. CR-unify(x *y, a) calls E,-unify since both terms are homogeneous in {*, a}. This results in 


the singleton set of unifiers {{x «a, y«a}} 


from p,: b b a*a a*a 
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from y: b b a*a a 
Composing the unifiers from steps 1 through 4, we get the substitution: 
{xa Jeazeb,v eb voea +a} 
or restricting to the variables in t and s: 
{xea,yea,z¢b}. 


We check that this is indeed a unifier of t and s by applying and testing for E-equality. 
b + (a *a) : a+b 


2.4 Difficulties in Exending CR-unify 


The restrictions of confinement and regularity were carefully chosen. Two arguments will make 
this point clear. Most importantly, we will prove in Chapter 3 that the restrictions are sufficient, i.e., 
that CR-unify is consistent, complete, and terminating for the confined regular theories. This section 
is devoted to showing the necessity of our restrictions. We will show that CR-unify is not complete for 
theories that are either unconfined or non-regular. A great deal of effort was put into the design of the 
CR-unify algorithm, and a number modifications attempted for the purpose of weakening the restric- 
tions of confinement and regularity. Although none of these modifications solve the more general 
problems, the incorrectness was not always obvious. We include some examples in this section that 


point to problems in the modified algorithms. 


The modified algorithms will be within the framework of our approach, i.e., E* will have a par- 
titioned presentation defining the sub-theories and these sub-theories will be assumed to have com- 
plete and terminating unification algorithms. Since the problem with CR-unify is completeness, the 


modifications will involve changing a failure case to a non-failure case. 


The first example shows that CR-unify is not complete for unconfined theories because case 4 of 
the procedure signals failure when the head symbols of the two terms are constrained by different 


sub-theories. 


Example 1. 
Ej: f(x) = x t = f(x) s = gly) 
E,: g(x) =x 


Case 4 of CR-unify applies because f+ g, so CR-unify returns the empty set. However, the substitu- 
T 


tion {x «y} is a unifier of fand s. 


Note that if we replace case 4 of CR-unify with two recursive steps, one for each sub-theory, the 


problem in Example 1 will cause the procedure to loop. 
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Example 2. 


2 = f(x) , =v, 
Y= {v «g(y)} 
f= V5 . 3, = 80) 


Yo = {roth(x)} 
E,-unify(f(x), v ,) returns the single unifier 

p = {v,ef(x)}. 
Map-unity(y,, p) call CR-unify on: 

t' = g(y) s' = f(x) 


which is the original problem. 


The procedure will similarly loop on the second recursive call that starts with E,-unifying f and fe 


Example 3 is another example in which a unifier exists, but the CR-unify algorithm will fail to find it. 


In this example case C of CAR-variable-unify is the incomplete step. 


Example 3. 
E,:f(0, x) = x t = f(x, g(y,2z)) S=2 
Esetiah= x 


This example fails in case C of CR-variable-unify, because z occurs below an inhomogeneous sub- 


term of s, although the substitution {x «0, y«2} is a unifier of t and s. 


Again, if we replace case C of CA-variable-unify with a recursive step as in case B of 


CR-variable-unify and case 5 of CR-unify, the procedure will loop. 


Example 4. 

f= f(x, v,) $=2 
y = {v,*8(,z)} 

E, -unify(f(x, v,), 2) returns a single unifier 
p a {2ef(x, v ,)}- 

Map-unify(y, p) will call CR-unify on: 
th =v, S, = g(y,z) and 
tp = f(x, v,) SZ, 

CR-unify(v 4°80’, z)) returns a single unifier: 
o, = {v,+8, z)}. 

Applying this to the second pair of terms gives: 
o,f, = f(x,2(,2)) oS, = 2, 

which is the original problem. 


The unbeation problem in Examples 3 and 4 are correctly solved by replacing x with a constant 
or by E-matching : by « aS suggested by [Tiden 85]. However, Example 5 shows that these methods 


do not work in general. 
Example 5. 
E.:t(y, x)= x t= h(t(y,y)) S=Xx 


f= h(v,) S=¢, 


where c , is anew constant. 


The terms /: (vy a) and ¢, are not E,-unifiable. so this procedure will fail to find the unifier, {x ¢h (v,), 


yeh(v,)}. 


Intuitively, the problem illustrated in Examples 4 and 5 is that the evidence of z int is lost when t¢ is 
homogenized. If. instead of replacing »(:, z) with a new variable, we replace it with z, then the 
algorithm finds the correct unifier. Unfortunately, this technique will not work in general for non- 
regular theories as shown in Example 6. Like Example 5. Example 6 is also a counter-example to the 
idea of replacing a variable to be unified with a constant. In Example 5 the theory is regular by not 


confined. and in Exampie 6 the theory is confined but not regular. 


Example 6. 
E,: f(x, 0) = 0 t = A(f(x,»)) s=y 
E,: © (h unconstrained) 


t= h(y) s=y 
where t' is the homogenized form of t, but } is used in place of a 
new variable because it appears in f(x, 1) and itis the term with 
which t is being unified. 


Unifying 4()') and y in the empty theory will fail, so the E-unifier, {) +0}, will not be found. 


Examples corresponding to 3 and 4 also exist for non-regular theories. The unification problem in 
Example 6 shows that case C of CR-variable-unify is not complete, and that modifying case C to 


perform recursion will make the process loop. 


The examples themselves are interesting, but a general conclusion can also be drawn, namely, 
that the properties of completeness and termination conflict. These examples also motivate the 
careful proofs in Chapter 3, since some of the completeness problems would not arise in the intuitive 


arguments of the lemmas, but would arise in the careful inductive proofs of the theorems. 


: We also note that the trivial failure cases in classical unification, ia., clashes and cycles, are no ae 
longer failure cases in many equational unification aigorithme. ‘While the relaxation of clash detection 
has been treated in the literature, the difficulty of detecting cycles in theories such as idempotence, 
where unifying f(x, x) and x does not cause-a cycle, has been underestimated. 
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Chapter Three 


Proof of Total Correctness 


This chapter presents the proof of total correctness for the CR-unify procedure. The correctness 
proof is divided into a consistency theorem, a completeness theorem, and a termination theorem. 
The consistency and completeness theorems, given in Sections 3.2 and 3.3, respectively, are proved 
by induction on the depth of recursion and are therefore dependent on the termination theorem in 
Section 3.4. The proof of termination is by induction on a noetherian ordering on pairs of input terms; 
it is a generalization of Fages’ proof of termination for AC-unification [Fages 84]. Before presenting 
the three correctness results, we will begin in Section 3.1 with some definitions for the proofs and 


some important theorems having to do with our restriction to confined regular theories. 


The correctness proofs depend on CR-unify being protective. For the sake of thoroughness, we 
present a more careful description of the naming of new variables in Appendix A, along with a 
discussion of protection. Throughout this chapter we will assume each new variable is one that has 
not occurred previously and thus, for example, the new variables used in forming f and $ in CR-unify 
are disjoint from new variables generated within an invocation of a sub-theory unification algorithm. 
The formal discussion of protection and naming of new variables is relegated to an appendix because 


it is more technical than interesting. 


3.1 An Overview 


Throughout this chapter we will assume a given set of equations, E, defining a strictly consistent 
equational theory. E* is partitioned by 7 = {E,, Eanes E}, which also defines a partition on the 
signature, F, of E*. For each sub-theory there exists a complete, consistent, protective, terminating 
unification algorithm. We make no other global assumptions on E* and will be careful to point out 


cases in which E* is assumed to be confined or regular. 


Each of the three main theorems, consistency, completeness, and termination, is proved by induc- 
tion. We could imagine combining the three proofs into a single inductive proof of total correctness. 
This is not necessary because the termination proof does not depend on the recursive calls returning 


complete sets of unifiers, but depends instead only on the termination of the recursive call and some 
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technical sreperties cf the returned substitutions. However. the proposed single correctness proof 


will provide a useful structure for demonstrating the overall structure of the three proofs. 


First. at each step Some progress must be made towards finding an answer so that the next 
unification problem to be solved is, by some measure. easier than the current one; this property, 
along with the fact that our measure of complexity does not decrease infinitely. implies termination. 
Second, each step must generates some piece of what could turn out to be a legitimate unifier, ie., 
we cannot generate any incorrect pieces. This is the criterion for consistency. Finally, every pos- 
sibility leading to a good answer must be considered. The set of partial answers generated at each 
step must be complete in the sense that every mast general unifier can be formed from one of the 
elements of the set. This property will give us completeness. By separating these three independent 
properties into three theorems. we are able to focus our attention on one of the correctness 


properties at a time. 


In proving the inductive hypotheses in the consistency and completeness proof, we need to show 
that the basic approach to breaking down the problem and building up the solutions is correct. 
Intuitively. we would like to show the diagram of Figure 3-1 is correct. Although we will not prove this 
diagram commutes. it will help motivate the technical lemmas in the proof of completeness, and is 
very close to the correctness diagram that will be proved. There are two levels of detail at which the 
diagram should be viewed. Consider first only the mappings, i.e., the labels on edges, and note that 
the right and left halves are mirror images, each showing o°p a Now consider the labels on 
vertices. fand § are homogeneous terms and t and s are some possibly inhomogeneous instances of 


these terms, p is an E,-unifier of fand S, and o is an E-unifier of t and s. 


Figure 3-1: Diagram Exemplifying Correctness Properties 


The consistency argument says roughly that if y is the preserving substitution for f, t, and §, s, 
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then for any sub-theory unifier, p, of the homogeneous terms, all E-unifiers of p and y are E-unifiers of 
t and s. The completeness argument says roughly that for any E-unifier, o, there exists some sub- 


theory unifier, p, of the homogeneous terms, such that o is an E-unifier of p and y. 


3.1.1 Definitions for Terms in E* 


Because E* is partitioned, the terms in J/ = have some interesting properties related to the par- 
E 


titioning. This section establishes some new concepts for describing these properties. 


There is a partial order on terms defined as t-<s if and only if t is a subterm of s. We use this to 
define another partial order on terms that is contained in ~< but takes into account the equivalence 


classes of function symbols defined by 7. 


If Oo is a proper occurrence, let prefix(o) be the string o minus the fast number, ie, 
for all oxe, there exists i such that o = prefix(o).i. In the tree representation of terms, prefix(o) 


indexes the parent of the node at occurrence o. 


Definition. Given a partition, =, on F, a term, t, and an occurrence, o € O(t), o is said to be 
significant in t if and only if dither. 

1.0 = e,in which case t/o =f, 

2. or, 0 is not strict in ¢, i.e., t/o is a variable, 


3. or, (t/0).head# (t/prefix(o)).head. 
v7 


In other words, an occurrence in a term t, is significant if the subterm at the occurrence has a head 
symbol in a different partition of F than the symbol it occurs under. In addition, the empty occurrence 


and all variable occurrences in a term are significant. 


Definition. The term s is a significant subterm of t, denoted SSyt, if and only if: 

4 o€ O(t) such that t/o = s and ois a significant occurrence int. 
Note that s may appear at both significant and insignificant occurrences within ¢, but if at least one 
occurrence is significant, then s is significant in t. If s is proper in t as well as significant, we write 


S~<yl. 


Let F be partitioned in to {+}, {*, a}, {*} and {b, f, g} as in Figure 2-1. Figure 3-2 shows a term 
with all of its significant subterms outlined. As noted, a is significant in its first and third occurrences, 


but not in its second occurrence. 


A suggestion of the relevance of significant subterms comes from considering an arbitrary term, t¢, 


i 
c 


Figure 3-2: A Term and its Significant Subterms 


its homogeneous form, f, and the preserving substitution, y =Preserve(t, f). All terms in ®(y) are 
significant in t. The only significant subterms of the homogeneous term, f, are f itself and the 


variables in the f. 


Next we define the notion of the parents of aterm t in s, denoted Parents(t, s), where a parent is an 
operator symbol in s having t as an argument. A special null operator, n, is included denote the 


parent of a term within itself. 


Definition. The parents of tin s, written Parents(t, s) are: 
1. ift=s, then Parents(t,s) = {n}, 
2. otherwise, Parents(t,s) = {f| If (S418, 28s & (Fi, s; = 1}. 


This definition is extended to equivalence classes of F by considering only those classes with a 


representative in Parents(t, s): 
Definition. The parent sets of t in s, written ParSets(t, s) are: 
1. If t=s, then ParSets(t,s) = {{n}}, 
2. otherwise ParSets(t, s) = {[/]|/€Parents(t,s)}. 
Taking t to be the term in Figure 3-2 and using the same partitioning on F ({ +}, {*, a}, {*} and {4, f, 
2 }), we get the following values of parents and parent sets in t. 


Parents(x, t) = {+} ParSets(x,t)={{*, a}} 


Parenista the (7 9} ParSets(u.t)={{*. a}, £2. f, g}} 


Parents(y ea. Y= {x} ParSets(v su. th={{b,f, e}} 
Parenis(:. th={} ParSets(:. th={Q}} 
Parentstt. th= {ni} ParSets(t, th= {{u}} 


3.1.2 Properties of Confined and Regular Theories 


As stated in Section 2.1.3. we will limit the set of theories in E* by two syntactic restrictions: 
confinement and regularity. In this section we give some of the lemmas pertaining to confined and 
regular theories. characterizing the ways in which unification is simplified in these theories. For both 
confinement and regularity, we first prove that the restriction on the axioms is equivalent to the same 


restriction on the theory. 


3.1.2.1 Confined Theories 

Recail the definition of confined theories limits the equational theories to those containing no 
equations with a variable equal to a non-variable. Lemma 1 shows that we eliminate exactly those 
theories that are unconfined by eliminating those with unconfined presentations. Lemma 2 will show 
that confined theories contain only equations with head symbols constrained by the same E. and 


Theorem | relates this property to the unification problem. 


Lemma 1 and a number of other proofs involving equational theories will be done by induction on 
the length of proof. By the completeness of the inference rules in Section 1.4.2, we know that if f = 
then there exists a finite proof of Les: starting from the axioms in E and using only the five listed 
inference rules. We use this fact in the proof of Lemma 1, for example, where we want to show there 
are no equations in E* that are unconfined if there are no equations in E that are unconfined. To 
show any equation ies is confined, we need to show that the inference rules on!y prove confined 
equations, when they start with confined axioms. If we measure the proof as the number of steps, 
where each step requires application of one rule of inference, then the basis case is a proof of length 


zero. If the equation is proved without any inference rules, then it must be in the set of axioms. 


The inductive step proves that a proof of length n+1 gives only confined equations. assuming 
proofs of length n give only confined equations. Proving the induction step is done by examining 
each possible inference rule that could take us from the n'" to the n+ 15 step. If the transitive rule 
was the last step in a proof of tes, then there must have been some other term, r, such that ft ru and 
bes were proved in n steps or less. We can therefore assume the inductive hypothesis on eet and 
r ai which says both equations are confined. Combining this with the fact that the term r occurs in 
both equations, we know that either t, s, and r are all variables, or f, s, and r are all non-variables; in 
either case the equation f = will be confined, and the induction step for transitivity is proved. 
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Lemma 1: E is confined if and only if E* is confined. 
Proof. One direction (=) is obvious, the other will be proved by induction on the length of proof in 
E*. Let E be confined and show that any equation, tes, in E* must be confined. 


1. Basis: tes is an axiom in E, sot = is confined because E is confined. 


2. Inductive step: If t = has a proof of length n, then consider each possible inference rule for 


the last step in the proof: 


A. Reflexive: Trivial. 
B. Symmetric and transitive: Follow directly from the induction hypothesis. 
C. Equality: Yields only pairs of non-variable terms. 


D. instantiation: From t' =s deduce ot’ =o, where tis ot’ ands is os’ and t’ ss" has a proof 
of length n-1 or less. By the induction hypothesis, either t' and s' are both non-variables 
or both variables. If they are both non-variables, t and s will be also. If they are both 
variables, then they are the same variable (by consistency of E*), so t and s are either 


both the same variable or both the same non-variable term. 


QO 


One condition used in our proof of correctness of CR-unify is that E* is a confined theory. Lemma 
2 shows that in confined theories there are no equations whose right and left head symbols are 
constrained by different sub-theories. This gives us the invariant, stated in Theorem |, that two 
non-variable terms whose heads symbols are constrained by different sub-theories (from 7) of E* are 
not E-unifiable. We make use of this fact in case 4 of CR-unify where an empty set of unifiers is 


returned. 


Lemma 2: If E* is a confined theory, then t ng => t.head =s.head. 
T 


Proof. By definition, 7 presents the same theory as E, although the two may contain different 
axioms. We will make use of this fact by performing induction on the length of proof that is starting 


from the axioms in 7, rather than the axioms in E. 


1. Basis: If tes is an axiom in some E€m, then because a defines the partition on F, 


t.head =s.head. 
T 


2. Inductive step: Consider each possible inference rule for the last step in the proof. 


A. Reflexive: Trivial. 


B. Symmetric and Transitive: Follow directly from the induction hypothesis. 
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C. Equality: Yields only terms with the same head symbol. 


D. Instantiation: From t’ deduce at’ =o, where ot’ =f and as’ =s, 


i) If t’ and s’ are non-variable terms, then by the induction hypothesis t'.head =s'.head 
7 


and hence t.head =s.head. 
T 


ii) If either t' or s’ is a variable, then they are both variables and both the same variable. 
Therefore t and s are the same term and the property holds trivially. 
Oo 


Theorem I: {if E* is a confined equational theory, and t and s are non-variable terms such that 
t.head# s.head, thent and s are not E-unifiable. 

7 
Proof. Any instance of t and s will have the same non-equivalent head symbols, and by Lemma 


2 these two terms cannot be equal in E*. QO 


3.1.2.2 Regular Theories 

The restriction to regular theories is needed for case C of CR-variable-unify, a failure case. l.e., 
because C is a failure case, CR-unify is not complete for all confined theories, but regularity along 
with confinement of E* is sufficient for completeness. (See Section 2.4 for an example showing 
incompleteness of CR-unify for unconfined theory and a non-regular theory.) The key result of this 
section is Theorem Il. It states that a variable is not unifiable with a term, t, containing that variable if 
the variable occurs below the homogeneous part of the term, i.e., at an occurrence not in f. Lemma 


4is used to prove Lemma 5 which in turn is used to prove Lemma 6. 


Lemma 3: E is regular if and only if E* is regular. 


Proof. Again, one direction (<=) is obvious, the other will be proved by induction on the length of 
proof in E*. Let E be regular and show that any equation, ¢ =s, in E* must be regular. 


1. Basis: = as an axiom in E, sot = is regular by the hypothesis on E. 


2. Inductive step: Consider each possible inference rule for the last step in the proof: 


A. Reflexive: Trivial. 
B. Symmetric and transitive: Follow directly from the induction hypothesis. 


C. Equality: From t; 55: 1<i<n, deduce f(t,...t,) S18 yr-8,)s where t is F(tyvot,) and s is 
f(S4.058,)- Since t contains the union of all variables in the t's and s contains those in 
the S'S, these two sets are equal because they are each the union of n pairwise equal 


sets. 


D. instantiation: From t’=s' deduce ot re where t is at’ and s is os. By the induction 


E 
hypothesis. itt)= 1s). so consider each v in this set. If » is in the domain of o, then 
the variables in oy will occur in both tf and s and otherwise vy itself will occur in both ¢ 


and s. 


The property gained by restricting E* to a regular theary can be seen by considering the set of all 


significant subterms of a pair of equivalent terms. Lemmas 4 and 5 characterize this property 


precisely: Lemma 4 states that any variable occurring in two equivalent terms, will occur under the 


same set of parent sets, i.e.. equivalence classes of parents; Lemma 5 shows that given two con- 


gruent terms, the sets of all significant subterms of the terms are equal modulo E. 


Lemma 4: If E* is confined and regular. and use then for all y €V, ParSets(y, t) = ParSets(v, $s). 
Proof. By induction on the length of proof of t = starting from the axioms in 7. 
1. Basis: If = is an axiom in some E.€q, then t and s are homogeneous in the constrained 
symbols of E*. and there are three cases: 


A. If + €i(t) then by regularity of E. vy €7(s), so ParSets(y, t) = ParSets(y,s) = ©. 


B. 1 €71t) and t is a non-variable term, then s is a non-variable term and by regularity of E*, 
y€V{s). Therefore, ParSets(v, t) = ParSets(v, s), since equations in @ are by definition 


homogeneous. 


C. If t is a variable and vy =?t, then s is the same variable. Therefore, ParSets(y, t) = 


ParSets(v,s) = {{}}. 


2. Inductive step: Consider each possible inference rule for the last step in the proof: use one 


of the following inference rules. 


A. Reflexive, symmetric, and transitive: Obvious. 


B. Equality: From t, aS, 1Si<n, deduce f(t,,...t,) Ef (Syr--8p)h where tis f(t,,...,1,) and s is 
F(S40458,)- If t, and s; are non-variables, then by the induction hypothesis, all variables in 
t will occur under the same parent sets in s, and vice versa. If t, and sare variables, 


then they are the same variable so both will occur under the equivalence class, [/]. 


C. instantiation: From ¢’ =s' deduce at’ =0s", where ot’ =t and os’ =s. Consider the follow- 
ing two cases on the occurrence of v in ot’. 

i) If the occurrence of v in ot’ is also in t’, then by the induction hypothesis there is 
some occurrence in s’ under the same parent set. The same is true with t’ and s’ 
reversed. 
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ii) If the occurrence of » is in ot’ and not int’, then v must be an element of 5(a) or, 
restated, there exists v'€S(a) such that » € Y(ov') and yv‘€t'. By the induction 
hypothesis, ParSets(\’, t') = ParSets(v', s’), and if ov’ is simply the variable, y, then 
all occurrences of » will correspond to an occurrence of v' inbothtands. Ifev'isa 
non-variable term containing v, then by regularity, the entire subterm (ov) will occur 
in both t and s and thus vy will occur under the same operators in both. 


O 


Lemma 5 states that if tes, then there is a one-to-one correspondence modulo E between the 
significant subterms of t and the significant subterms of s. The main purpose of Lemma 5 is to prove 


Lemma 6, although the result is interesting in its own right as a property of confined regular theories. 


Lemma 5: If E* is confined and regular, and ra then for all t'<yt there exists s'<ys such that 
t'=s'. 
E 
Proof. By induction on the length of proof that t a starting from the axioms in 7. 
1. Basis: If ts is in some E.€q, then ft and s are homogeneous with respect to the symbols 
constrained by E*. Therefore, the only significant subterms int are ft itself and the variables 
in tt). If t' =t, we can take s’ to be s, and if t' is a variable in t, we can take s’ to be the same 


variable, which must an element of 7(s) by regularity of E*. 


2. Inductive step: Consider each possible inference rule for the last step in the proof. 


A. Reflexive, symmetric, and transitive: Obvious. 


B. Equality: From t, =S) 1 Sign, deduce f(t,,...t,) S(Sy1-08,); where t is f(t,,...,.f,) and s is 
f(Sj1-48,). By the induction hypothesis, for every t<yt, there exists s~<vs. such that 
tes Furthermore, by Lemma 2, t..head = s..head, so t, will be significant in t if and only 

T 


ifs, is significant in s. 


C. Instantiation: From ie eS, deduce at, £98) where t is ot, and s is os,. Consider t'<yt, 


such that ot,/o = t’. 


i) If ois a strict occurrence of O(t,), then by the induction hypothesis on n, there exists 
o'€0(s,) such that t,/o =8,/0. Applying o to these two equivalent subterms we get 


at,/0 : as,/0', but of,/o is t'so we can take s’ to be as,/0'. 


iil) lf o is a variable occurrence, then t,/o is variable, call it v. By Lemma 4, 
ParSets(v,t,) = ParSets(v, s,). Therefore, t’ will itself occur significantly in s if and 


only if it does in t. 


iii) If o€ O(1,), then ft is a proper significant subterm of ov for some y€(a). By 
regularity of E*, vet) => vEUs,), so oy will occur in S=0S,, and hence ?’ will 
occur significantly in s. 

O 


The above lemmas give some general properties about confined regular theories. More specific 
to our purposes is the following lemma, which gives a sufficient condition under which terms are not 
equal in the theory.. Lemma 6 states that no term is equal modulo E to any subterm of a proper 
significant subterm of itself and is proved by induction on the structure of terms, i.e., using the 


subterm ordering. Theorem Il relates the equality problem back to the the unification problem. 


Lemma 6: If E* is confined and regular, and t, s, and r are terms such that t<r-~<ys, then ha 
Proof. By structural induction on s. 


1. Basis: If s is a variable or constant, then Ar<gs, so the hypothesis is vacuously true. 


2. Inductive step: If s is a non-variable, non-constant term, then there are three cases to 
consider with respect tot. 


A. lf tis a variable, then tes by the confinement property of E*. 
B. lf tis anon-variable term such that t.head#s.head then ies by Lemma 2. 
7 


C. lf t is a non-variable term such that t.head =s.head, then assume i and derive a 
7 


contradiction. 


ds’ such that t<s'<ws and s'.head#s.head. (The existence of the significant subterm, 
7 


r, between ft and s implies the existence of s’ with inequivalent head.) 
Since t.head#s'.head, t<s'<ys. 
By Lemma 5, t as and s'~<ys implies there is some t’~<yt such that f 5s. - 
From Lemma 2, t’.head =s.head. 
Therefore, t'.head#t.head, and, again, t’ must be is proper int. 
So far we have t'<yt~<s'<ys. 
Apply the induction hypothesis to s’, Asing t' fort, s’ fors’ and t’ for r, respectively. 


By the induction hypothesis, es’ but this is a contradiction, since t' was chosen such 


that t'=s'. 
E 
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The main result for confined regular theories is in Theorem Il, which shows that case C of 


CR-varabie u7iy should be a failure case. 


Theorem tl: !f E* is confined and regular and r and s are non-variabie terms such that r~<ys and 
v€¥{r). then v and s are not unifiable. 
Proof. Ifo is unifier of » and s, then: 
oy =os 
E 
and, furthermore. the following property holds: 
OV Sor~<yl. 


This contradicts Lemma 6. so no such o can exist. O 


Note that the existence of the significant subterm, r, in both Lemma 6 and Theorem Il is necessary. 
Without r the Theorem would state that no variable is unifiable (in any confined regular theory) with a 
term that contains it. A simple counter-example to this stronger statement is the pair of terms f(x) 
and « in the theory presented by E = {f(.) ea}. the terms are unifiable though E* is confined and 


regular. 


The important results of this section are Theorems | and Il. Theorem | will be used to show the 
completeness of case 4 of CAR-unify. where the empty set of unifiers is returned for two non-variable 
terms with heads constrained by different sub-theories. Theorem II will be used to show the com- 
pleteness of case C of CR-variable-unify, where the empty set of unifiers is returned for one variable 
and one non-variable term when the variable occurs in the non-variable term below its top 


homogeneous part. 


3.2 Consistency 


This section presents the proof of consistency for CR-unify. The key lemma for consistency, 
Lemma 7, can be explained informally as showing that any substitution which unifies a sub-theory 
unifier, p, with the preserving substitution, y, will also be a unifier of the two terms, t and s (where the 
notation here is that of the CR-unify procedure). This lemma alone is not enough to show consis- 
tency, since the consistency of CR-unify depends on the consistency of map-unity, which in turn 
depends on the consistency of CH-unify. Therefore, we will use an induction on the depth of recur- 


sion for proving the consistency of CR-unify in Theorem lil. 


Lemma 7: Let F. be the set of constrained symbols for E€a and let t and s be homogeneous terms in 


F.. If y, p, and o are substitutions then: 
t= ps&o°p=a°y => t)=a(ys). 
p z? p=o°y oly) =aly ) 
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Proof. 


pt = ps => pt = PS, since E; is a sub-theory of E 
= a(pt) E o(ps), by applying the substitution, o 
=> (o°p)t = (orp), from the definition of composition 
=> (o°y)t = (o°y)s, substituting o°y for o°p 
= o(yt) = o(ys) 


0 


The consistency theorem is given below. The proof is done on a case-by-case basis, where the 
cases are those appearing with labels in CR-unify algorithm. The basis cases are those in which no 
recursion is done and the inductive steps are those that involve recursion. Theorem Ill uses Lemma 


7 in proving the inductive steps. 


Theorem Ill: If o is a substitution produced by CR-unify(t, s), then o is an E-unifier of t and s. 
Proof. By induction on the level of recursion of CA-unify. 


1. Basis: Cases 1 and 4 of the CR-unify procedure and cases A and C of CA-variable-unify, as 


called in cases 2 and 3 of the CR-unify procedure: 


A. Case 1: ft and s are both variables, so {t « s} is a unifier. 

B. Case 4: Returns the empty set, so the theorem holds vacuously. 

C. Case A: v does not occur in s, so the single returned substitution, {v « s}, is a unifier. 
D. Case C: Returns the empty set, so the theorem holds vacuously. 


2. Inductive step: Case 5 of CR-unify and case B of CR-variable-unify are the inductive steps. 


A. Case 5: t and s are non-variable terms, fand § are their respective homogeneous forms, 


and y is a preserving substitution for both t, fand s, &. 
By consistency of each of the E.-unify procedures, each p€P is a unifier of fand §. 
Choose a value for p. 


Each o€2 is generated by map-unify(y, p) for some p, and is therefore of the form 
@°...2W,, where w ECR-unily(w, pv, @;4Y;): By the induction hypothesis, each 


value of w; is a unifier of w, ,pv, and w, ,yv,, So a is an E-unifier of each correspond- 
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ing pair of terms in the range of p and y and is therefore a unifier of p and y 


themselves. 


y p.and a. meet the conditions of Lemma 7 with f and § for t and s in the lemma and t 


and s for yt and ys in the lemma, so o is a unifier of tands. 


Therefore, 21471)U4s) is also a unifier of t and s. 


B. Case B: : is a variable, s is non-variable term, § is the homogeneous form of s and y is a 


preserving substitution for s. §. 


By the assumption of consistency of the E.-unify procedures, each p€P is a unifier of v 


and §. 


By the same argument as in case 5, each o€Z is an E-unifier of vy and s in 


CR-variable-unify, which is exactly the pair of terms t,s ors, t of CR-unify. 


Therefore. oly, }U is) 'S a unifier of t and s of CR-unify. 


3.3 Completeness 


In this section we will prove that CR-unify is complete for ail confined regular theories. The proof 
of the completeness theorem, Theorem iV, is done by induction on the depth of recursion, and its 
structure is similar to the proof of consistency, Theorem ill. The proof of Theorem IV is given in 
Section 3.3.3, and uses a number of lemmas developed in Sections 3.3.1 and 3.3.2. The main lemmas 
are Lemmas 12 and 13 in Section 3.3.2, the proofs of which rely on the technical definitions and 
lemmas of Section 3.3.1. The proof of Theorem IV will also use Theorems | and il, since each gives an 
independent set of sufficient conditions for completeness of a failure case. Section 3.3.1 gives some 
technical definitions of functions that are used in the proof of completeness but are not needed in the 


implementation 


3.3.1 A New Homogenizing Operation 


Our proof of completeness uses a function, U-Homog, for forming homogeneous terms. The 
homogenizing operation in Section 2.1.2.1 is not unique for a given input, but may vary in the names 
of new variables. The function defined here will use a special set of variables and have an inverse 


mapping which is a substitution; both the variable set and the inverse substitution are universally 


defined for E*. In addition, we will extend U-Homog to a function on substitutions. The functions 
defined here are used solely as aids in the proof of completeness and are used in the implementation 


of CR-unify. 


The special set of variables used in forming homogeneous terms will be denoted by U. Each 
variable in U represents an equivalence class of terms in the theory, E*. There is one for each 
element of quotient algebra, T(F, V-U)/ a We will represent each variable in U as Wig where ¢ is 
some term containing no variables in U, and [f] represents the equivalence class of which ¢ is a 
member. By definition, we know ies => u ae Wisp i.e., u tt] and Wg] are two denotations for the same 
variable. Henceforth, we will assume the existence of this set, U, as it is universally defined for E*, 


and denote V-U, the complement of U, by 7U. 


3.3.1.1 Homogeneity Using U 
The function, U-Homog, is similar to Homog except that each maximal subterm whose head is not 
in the set F will be replaced with an element of U rather than with an arbitrary new variable. The 


following definition of U-Homog differs from the definition of Homog only in case 3. 


Definition. Let F be a set of function symbols and t be a term containing no variables in U. 


U-Homog(t, F) is defined as follows: 


1. If tis a variable, then U-Homog(t, F) = t. 


2.|ft = f (tyreat,) and f € F, then 
U-Homog(t, F) = f(U-Homog(t,, F),....U-Homog(t., F)). 


3. Ift = Ftyvt ht € F then U-Homog(t, F) = Mig 


We extend the notion of homogeneity and the homogenizing function to substitutions. A substitu- 
tion, a, is homogeneous with respect to a set of function symbols, F, if and only if oe F(t)}CF. We 
t€(a) 


define a function U-HomogMap on substitutions, which is analogous to U-Homog on terms. 


Definition. Let o be a substitution containing no variables from U (i.e., [B(a)UI(a)]MU = SB) and let 
F be a set of function symbols. Then U-HomogMap(o, F) is a substitution such that 
%)(U-HomogMap(a, F)) C D(a) UU and for all vE D(o)UU: 


1. If vED(a), then 
U-HomogMap(o,F)v = U-Homog({evy, F). 


2. If y €U, then by the definition of U, v = u,,, forsome t (where Wt)NU = SB), and 


It] 


U-HomogMap(o, F)y = Wot} 


U-HomogMap is well-defined because t2s => of =a8, so picking an arbitrary ! from [t] will result in a 
unique equivalence class, [ot], and thus a unique variable, u [ot] The domain of substitutions formed 
by U-HomogMap may be infinite unlike other substitutions we have used thus far. Extending substitu- 
tions in this manner gives no additional most general unifiers, because any term has only a finite 
number of variables. When the value of F is clear from context, we will use f and o to denote 


U-Homog(t, F) and U-HomogMap(a, F), respectively. 


As noted, the definition of f and f are identical except in the names of variables used to replace 
subterms. Furthermore, if F is [t.head] as in f, ft is an instance of f for any t, since f uses different 
variables for each replaced subterm whereas f will use the same variable more than once if two 
replaced subterms are equal modulo E. Therefore, we can relate fto f by finding the match of f by f. 
Furthermore, observe that this is the homogeneous form (with respect to F) of the preserving substitu- 


tion for t and f. 


Proposition 1: If F = [t.head] and y = Preserve(t, f), then 
yf=t. 


The definition of U-Homog may, at this point, seem somewhat under-motivated. The following 
lemmas describe some useful properties of U-Homog that will be used in the completeness lemmas 
and are not valid for Homog. The two functions, Homog of the implementation and U-Homog of the 
lemmas, are related in the completeness theorem using Proposition 1. Lemma 8 shows that U-Homog 
commutes with the application of substitutions to terms, and Lemma 9 extends this property to com- 
position of two substitutions. Lemma 10 is the key result of these lemmas; it justifies the division of 
the unification problem in E* into unification problems in the sub-theories by showing that the exist- 
ence of a particular equation in E* implies the existence of the homogeneous form of the same 
equation in the sub-theories. Henceforth, we will assume that no substitution or term contains a 


variable from U, unless formed from U-Homog or U-HomogMap. 


Lemma 8: For any set of function symbols, F: 
o tof. 
Proof. By induction on the structure of t. 
1. Basis: t is a variable, call it vy. 
A. If v€S(o) then 
of=aVv=av 


= ov, by the definition of U-HomogMap. 


Ni 
Q 
iN} 
Q 


2. Inductive step: f = f(t,.....f,). for some f €F of arity n2 0. 


A. If f €F, then: 


ote Of (ty st) 


= o f(t... 1), because fis in F 

aot); by homomorphism of substitutions 
at). by the induction hypothesis 
f(at,.....07,). from the definition of U-Homog 


ti) by homomorphism of substitutions 


B. If f €F. then: 


= of (t,t) 


at 
~! 


ou tt because / is not in F 


W rt} since WU 


at, since the head symbol of at is still f, and therefore not in F 


0 


Lemma 9 extends the commutativity of U-HomogMap with substitution application to the com- 


mutativity of U-HomogMap with substitution composition. The proof is straightforward. 


Lemma 9: If F is any set of function symbols, then: 


Proof. ShowV v€ Vo,°o,) = o 


. 4° ov . There are two cases: 


1. If v€U, then v =w,,, for some term ¢, then: 


{t] 
O5°O oun = *U(o,20,)) ~ “fo (ont) ~ O44 foot = 5 4(F54 14) 
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2. If y EU: 
0,°0,V = 0,°0,) = 0,°C,V = a ,(o,v) 
= 0 ,(o,v) 7 0 ,(0,¥) = 0,°O,V 
= 0.°o 
9 ,°O,V 


O 


Lemma 10 is the key to our completeness argument. Unlike Lemmas 8 and 9, which do not 


assume any relationship between F and 7, Lemma 10 will take F to be a set of function symbols 


constrained by one of the sub-theories. It states that the homogeneous forms (by U-Homog) of any 


two E-equal terms are E,-equal in the sub-theory constraining F. It is important that F is the set of 


constrained symbols for the sub-theory, E*, but we do not make any assumptions about head symbols 


of t and s belonging to F. 


Lemma 10: If F is the set of constrained symbols for some sub-theory, E* of E*, and E* is confined, 


then: 
t=s =>T=S. 
E E 
Proof. By induction on the length of proof of t a starting from the axioms in 7. 


1. Basis: If t = is an axiom in some Een, then: 


A. If E = E,, ie., tes € E., then t and s must be homogeneous in F. Therefore, f=t, $=S, 


andt=s€ E,, so there is a proofin E,. 


B. If E*E,, Le, iss ¢ E. then assume, since E* is confined, that both t and s are non- 
variables. Furthermore, by disjointness of function symbols in elements of 7 we know 
[F(t}UF(s)JNF = S. Thus, t= Win and s= Wr) and since tes, it follows from the defini- 


tion of U-Homog that u tt] and u [s] are identical and thus u tt] z” [s] by reflexivity. 


2. Inductive step: Consider each possible inference rule for the last step in the proof. 


A. Reflexive, symmetric and transitive rules: Obvious. 


B. Equality: From t, =5) 1Sis¢n, deduce Ff (tysot) FS y-8,)) where t is f(t,,....f,) and s is 


L (845-58): By the induction hypothesis, f : S; for 1 <i<n. There are two cases on f: 
i 


i) If f €F, then t= f(t,,...f,) and S=f (S,,..8,), 80 t= Sis implied by the equality rule. 


E, 
i 


ii) If f €F, then t= Wig and § = Wisy Since tes, Wa and Wr are identical, and thus es 


by reflexivity. 
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C. Instantiation: From ft’ zs deduce ot! =0s), where t is ot’ and s is os. By the inductive 
hypothesis we know tee and applying the substitution o to this equation we get 
of = a8", which by by Lemma 8 implies af” = os. But, of’ is tf, and os’ is s, so this gives 


i 
O 


Note that the assumption that E* is confined is necessary. Without this assumption, case B in the 
basis case would not hold. Consider an equation, for example, v =f, where t.head¢€F. In this case, 


v=vand f= Wap but v and Uj, are different variables and are therefore not equal in E*. 


While intuitively Lemma 10 is the key to the completeness argument, we must still relate the 
equality problem back to the unification problem and relate the U-Homog forms of terms in the lemma 


to the Homog forms in the algorithm. 


3.3.1.2 The inverse Substitution 

For the homogenizing function Homog, we were able to define the notion of a preserving substitu- 
tion, Preserve(t, f), which mapped a homogeneous form of a term back to the original term. For 
U-Homog, such a substitution cannot be defined, since two subterms may be different terms, but 
equal modulo E, and will thus be replaced by the same variable. We define instead the universal 
E-preserving substitution, p, which maps each variable in U to some element of the equivalence class 


of terms it represents. 


Definition. Let » be a substitution such that B() = U and Vu eu: BU y= t’, such that ¢’ st. 


The choice of which term in the equivalence class to use is not important but only serves to take 
us from an element of U back to the set of terms in which we are working, terms that do not contains 
variables from U. The axiom of choice guarantees the existence of such a substitution. By construc- 


tion of » and the homomorphism of substitutions, we now have the following property: 


Proposition 2: For any term, t, and any set of function symbols, F: 


u(t) at. 


We would like to extend this property directly to substitutions and get po ao but because the 
domain of p contains all variables in U, composing p with the homogeneous form of a substitution 
yields a substitution with more variables in its domain than the original substitution. Therefore, the 


extension to substitutions is the following lemma: 
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Lemma 11: Let F be any set of function symbols. 


peo = oop. 


Proof. Show V: €V. (ou) + = (w°a)y. Consider the following cases on y : 


E 
1.1fy € Z(o), then: 


peo y = pov) 
= ov. by Proposition 2 
= o°pY, because 1 €U 
2. If € U, then it is of the form u cy for some term, t, and we have: 
POO Hig = HUM cy) 
=ot 
E 
=o (jeu ir) 
eed 


3. If y € UUD(a), then poo v=rve= oon v 


O 


The property proved in Lemma 11 is expressed as a diagram in Figure 3-3. This diagram will 


appear as a basic component in Figure 3-4, which illustrates the main completeness lemma, Lemma 


13. Figure 3-4 will bring us very close to the preliminary correctness diagram of Figure 3-1. 


Figure 3-3: Commuting Diagram for the Universal E-preserving Substitution 
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3.3.2 The Completeness Lemmas 


Having built up some background lemmas to use in proofs, we are now ready to prove the main 
results leading to completeness. The first lemma, Lemma 12 shows that the decomposition process is 
complete, that every E-unifier is made up only of pieces of E.-unifiers. The second lemma, Lemma 13, 
shows that the combination process of unifying substitutions is complete, that all the necessary 


combinations of E-unifier pieces are considered. 


In the following lemmas, assume t and s are terms with compatible head symbols, i.e., either one is 
a variable and the other is not, or they are both non-variables with roots constrained by the same 
sub-theory. Let y be the combined preserving substitution for t and s, i.e., yf=t, yS= s and By) C 
TUF)UVS). E.-unify will be the sub-theory unification algorithm for the sub-theory constraining the 


head symbols of t or s (one or both depending on whether they are both non-variables or not). 


Lemma 12: 
If &* is confined then: 


ot =0s => o°yf= o°yYSs 
E Y E; ¢ 
Proof. 


ot = os => ot E as, by Lemma 10 
=>ot os, by Lemma 8 
=> oyf = oyS, by Proposition 1 
=> oy te o°y S, by Lemma 9 
i 


0 


This proves the existence of an E -unifier of the homogenized terms for any E-unifier of the 
unhomogenized terms, and just as important, gives a way of constructing the E.-unifier from the 
E-unifier. This is the key to showing that our approach of dividing E-unification problems into several 
E.-unification problems is complete. We still need to show that the manner in which E-unifiers are 


constructed from the E,-unifiers in the algorithm is complete. 


Recall that in CR-unify the E-unifiers are combined through unification of substitutions with the 
preserving substitution. One property sufficient to show completeness would be that any E-unifier 
unifies the preserving substitution and the constructed E.-unifier. However, this property does not 


hold in general for the following reason: some of the variables in the range of the sub-theory unifier 
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may be elements of U, whereas none of the variables in the range of y can be in U, and furthermore, o 


does not contain any variables from U in its domain, so o will not unify the two substitutions. 


Instead of the property in Figure 3-1, we show a weaker but still sufficient result which states that 
o is part of a substitution that unifies p and y. The substitution p will solve the problem of the 
variables in J(p) being in U since it maps every variable in U to some term containing no variables in 


U. 


Lemma 13 shows that the substitution oop is an E-unifier of y and the constructed p; and Figure 
3-4 gives the pictorial representation of the proof. The proof of Lemma 13 starts from the result in 
Lemma 11, just as the left-hand diagram of Figure 3-4 shows two instances of Figure 11. Similarly, the 
right-hand diagram represents the last step in the proof of Lemma 13, namely the statement of the 
Lemma. Note that the right-hand diagram of Figure 3-4 is almost identical to the diagrams in Figure 
3-1, our original goal for a correctness diagram; the only difference is that the o arrows in Figure 


3-1 are o°y arrows in Figure 3-4. 


—_—_—__1___» 
Y Y tt ey 
eaters erent 
= p oop 
o o ’ 
Se 
oop 
as ee oa : 


Figure 3-4: A Diagram of the Completeness Lemma 


Lemma 13: Ifo is an E-unifier of tand s, and p= 5° Vlg oy) then: 
o° pop eet 


Proof. Show the right-hand diagram of Figure 3-4 frollows from the left-hand diagram. 


pe(a°y) = (o°y)°H, from Lemma 11 
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=> 0°2°(0°y) = g°a°y°y. by applying @ 
=>a°e(o°y) S08 yep. by idempotence 
= o°yen. since UC V 
=> 6°1°(0°y) = o°y. since L(y) = U 


=> o°n°%(a°y) = aepey. since J(y)NU = Sand F(w)N-7U = S 


E 
=> o°p°p = o°pey, since T(a°y)-F(p)GU 


=> oouep = aopey. since (F(yJUS(p)NU = © 


3.3.3 A Proof of Completeness 


This section proves the completeness theorem for CAR-unify. The proof works in the reverse 
direction from the CA-unify algorithm, showing that any “good” answer was made from pieces that 
must be considered by CR-unify. We assume the input terms have a unifier, and by observing some 
properties of the unifier, show that either it or a more general unifier will be produced by CA-unify. 
The proof may also be somewhat contusing because the result seems too weak. At every step we are 
showing the there is some partially formed unifier that is more general than any actual unifier, but we 
never show that the pieces form a unifier. Recall, however, that this condition was proved separately 


in the proof of consistency, Theorem Ill. 


One of the non-obvious steps in the proof is that the minimal set of E.-unifiers is sufficiently large 
to find a complete set of E-unifiers. Note that this does not follow directly from the Lemma in Section 
3.3.2, since they only show that the E-unifier can be constructed (through substitution unification) 
from some E-unifier and not that it can be constructed from a minimal E.-unifier. This is important to 
the completeness argument since the E,-unification algorithms are only assumed to return minimal 


complete sets of unifiers. 


Theorem IV: Ifo is an E-unifier of t and s, then there exists @ € CR-unify(t, s) such that @ : o. 
Proof. By induction on the depth of recursion in CR-unify. Proving the induction step will require a 
second induction on the number of calls to CR-unify made from map-unify. 
1. The basis for induction on the depth of recursion are those cases in CR-unify for which 
map-unify is not called, namely, cases 1 and 4 of CA-unify and case A and C of 


CR-variable-unify under either 2 or 3 of CR-unity. 


A. Case 1: t and s are both variables. Any unifier of t and s is an instance of {tes}. 
B. Case A: y is a variable not occurring ins. Any unifier is an instance of {vy +s}. 


C. Case C: y is a variable occurring in s and in 3(y). Therefore, v is a variable in some 
non-variable proper significant subterm of s, and by Theorem II, v and s are not unifi- 


able. The empty set is a complete set of unifiers. 


D. Case 4: t and s are non-variable terms such that t.head#s.head. By the Theorem |, t and 
T 


s are not unifiable. Again, the empty set is complete. 


2. The inductive step includes cases 5 of CR-unify and case B of CA-variable-unify when 

called from either 2 or 3 of CR-unify. 
A. Case 5: t and s are non-variables such that t.head =s.head. Let E* be the sub-theory of 
E* constraining t.head and s.head and let F be die oat of function symbols constrained 


by E*. 


We will first show that the substitution p constructed in Lemma 12 bears a useful relation 


to some sub-theory unifier found in CR-unify. 


Let p= 0° Y lg igoyy 


From Lemma 12 we know: 
pf = p&. 


By the completeness of E,-unify, there exists p' CE.-unify(f, $) such that 
, Vv “a 
p “4 p, for V, = MAUS). 


{ 
Therefore, 4 such that pep’ 1p and since E* is a sub-theory of E*, 
res 
P°P =P: 
Without loss of generality, assume 
B(p)CDpUI(p'). 


By protectiveness of p’ we know 
(S(p)US(p)US(pJUS(p')(V,-V,) = . 
Let V5 = TMthUs) and V4 = V, UV, and from the previous step we have 
° Va 
P°P = p. 
Using these values of p, p’, and y, we can show the the inductive step for case 5, i.e., 


; ' V ta : 
that there exists @€map-unify(p, y), such that @ <2. By examination of map-unify 
E 
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we know that @ would have to be a substitution of the form WO. Wo, where n is the 
size of S(p')US(y) and W; is the unifier of the i” pair of terms. Let g, be the accumu- 
lated unifier for the i” iteration, 1.e., GF, = WOW, ,0..° Wo. Show by induction on i that 
Vv 
a. £2. 
Ne 


. : Vv 
i) Basis: If i=0, then Oy = Wy =. Therefore, Do <2g. (Note: n = O would cor- 
E 


respond to both t and s being homogeneous and equal modulo E.) 


ii) Inductive step: If 0, then let vy be the i'” variable considered in $(p')UD(y) for 


forming corresponding pairs of terms. 


: ; Vy 
Show there exists w,€ CR-unity(o, ,p'v;, 0, ,yv;) such that 0, = w.°o, , 4 oO. 


Oo. 


Vv : : : 
e = a, by the induction hypothesis on i 


Vv : 
a, = oop, since D(p)NS(o) = O&B(p)NV, = 


Jy such that yo, , teas 1, by definition of 2 
o°p°p = ooey, from Lemma 13 
o°LL°p 23 0°mey 
o°pepep’ 2 ony, since pop’ 3 p was shown above 
Ag’ such that y'°copep' 230°p°y, since Vx ES(p), x € D(p’) and: 
a) If x €V,, then x €5(a): 
HOpopX = Mpx 
= pox 
= Ox. 


b) If x €V,-V,, then x €S(y): 
x €B(u) & x ED(o). 


c) If x €V,V,UV,, then x is a new variable from p’: 
x €D(p) & x €B(o). 


Using this new substitution, p’, we have: 
‘ U ‘ Va ‘ 
Pop rocpep =Spo%pry, 
' 1 Vat 
cea i aa ee 
‘ ? Vv Ul . . 
Pp Ya. ,°p = p ovo, ,°y, by replacing oop with yea; 


Therefore, p'y is an E-unifier of t= 0, ,p'v, and s.= 0, ,yv,. 
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By the incucticn hypothesis on ft and s, there exists a substitution, w, € 
CR-unify(t.s ). such that: 

Vv, 

w, $4 gq oy, where V4= WU)JUNs)). 

E 

Since D(q_) contains variables from V, only if they are in F(a), 
y°o N10. 
E 


Furthermore, wco, 


miAc 


, =| ¢°¥ea,,, since by protectiveness of CR-unify, 


x EF (w)AS(o) only if x EV, 
We sig WEY 
We also have g °¥°a, , Ae °g =e 0s oop. 


Vv : : ; 
Therefore. D,=W°O | $1! o, which which completes the induction proof oni. 
E 


Vv ; : 

The induction on | shows a. <2, which proves the inductive step on t and s in case 5 
E 

and therefore proves the existence of some @ produced by CA-unify(t, s) such that 


6 og. 
E 


B. Case B: The proof follows Case 5, except » replaces both t and f. 


3.4 Proof of Termination 


If recursive calls from CR-unify were made only to subterms of the original inputs, then termination 
would be obvious. However. at each iteration within map-unify, the substitution accumulated up to a 
given point is applied to the next corresponding pair of terms, so the terms of a recursive call are not 
necessarily subterms of the origina! inputs, and may be larger than the inputs. The proof of termina- 
tion in Section 3.4 uses noetherian induction; we define a noetherian ordering on terms which is 
proved to be strictly decreasing with each level of recursion. The ordering is a generalization of the 
ordering used by Fages to show termination of AC-unification [Fages 84]. The proof of termination is 
a generalization of the termination proof for AC-unification [Fages 84]. We consistently extend his 
definitions to handle the general case for unification in equationa! theories. With minor exceptions, 
our definitions would be identical to Fages’ if we restricted ours to only the associative-commutative 
and empty theories. Noetherian induction is discussed briefly in Section 3.4.1. Section 
3.4.2 describes the noetherian ordering that is the basis of our termination proof and then gives some 


lemmas on the ordering. 
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3.4.1 Noetherian Induction 


Classical induction is based on a total ordering, typically the "less than" ordering on the natural 
numbers. Noetherian induction is more general in that it is based on a partial ordering; the partial 
ordering is additionally required to have no infinite decreasing paths. The reader is referred to [Cohn 
65] for a justification of noetherian induction and to [Huet 80b] for some abstract properties of order- 


ings. 


A typical class of noetherian orderings that are not total are those formed as lexicographic exten- 
sions of two or more total orderings. In fact any lexicographic extension of noetherian orderings is 
itself a noetherian ordering. We will use this fact in the definition of the noetherian ordering for our 


termination proof. 


3.4.2 A Noetherian Ordering for E-Unification 


The input to CR-unify is a pair of terms, and we define a noetherian ordering on pairs of terms to 
perform the induction. The partial ordering on terms defined by the subterm property is not accept- 
able as a basis for our induction because, as mentioned, recursive calls are made to terms formed by 
applying substitutions to subterms of the inputs and not just to subterms of the original input terms. 
Therefore, a recursive call from CR-unify is not necessarily made to arguments that are strictly less 
than the inputs by the subterm ordering. We instead define an ordering that is contained in the 
subterm ordering, but does not increase when certain substitutions are applied to inputs. This order- 
ing is shown to decrease with each level of recursion. It is a lexicographic extension of two orderings 


on the size of sets r and v, which are defined below. 


Because each E.-unification procedure is assumed to terminate, it is is appropriate for the current 
discussion to think of the unification of two homogeneous terms from the same sub-theory as being a 
single computation. Expanding on this idea, it is more difficult to unify a term with many in- 
homogeneous subterms than one that is close to being homogenous, even if the more homogeneous 
term has a larger actual size. This should help motivate the definition of the first measure of com- 


plexity based on the set of distinct terms having strict and significant occurrences in the input terms. 


Definition. r(t,s) = {r|ris non-variable & (rSyt or rSys)}. 


The intuition behind the second complexity measure is more difficult, but is related to 7 in the 
following sense: If a variable occurs under more than one operator set, where by operator set we 
mean an equivalence class of F, then any substitution of a non-variable term for the variables will 
result in at least one new strict significant occurrence in the resulting term. Therefore, v is the set of 


variables occurring under more than one operator set int ands. 
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Definition. Let v(t, s) = {x €V | size(ParSets(x , t) U ParSets(x,s)) > 1} 


We will measure the complexity of a particular unification problem by considering the cardinalities 
of v and +, and use this measure of complexity to define an ordering on the pairs of terms comprising 
the inputs. The ordering will be denoted ~<e and is defined as the lexicographic extension of the 


cardinalities of » and 7: 


Definition. <t', s> ~< <t, 8> if and only if: 


1. size(v(t’, s')) $ size(v(t, s)) 
2. and, (size(v(t', s')) = size(v(t, s)) => size(r(t’, s’)) < size(r(t, s))). 


We will use the notation <¢ to denote a reflexive ordering containing ~e, i.€., <t', S><e<t, s> if and 
only if: 


1.<t', s><ett, > 


2. or, (size(v(t', s')) = size(v(t, s)) & size(r(t’, s')) = size(r(t, ))). 


Note that the t and s can be commuted in the ordering, since r(t, s) = r(s, t) and p(t, s) = v(s, f). 
This is consistent with our expectations of a good measure for the complexity of unification since 
unification is itself commutative. Lemma 14 states that ~<e is a noetherian ordering. This will allow us 


to use it as the ordering for an inductive proof. 


Lemma 14: The ~<e ordering is noetherian. 
Proof. It is the lexicographic extension of two instances of the less than total ordering on the natural 


numbers. oO 


3.4.3 Some Properties of the Ordering 


The inductive hypothesis within the proof of termination can only be applied to pairs of terms that 
are strictly smaller (in this case by the ~<e ordering) than the given pair of terms. We will use the 
following lemmas to show that recursive calls are made to strictly smaller terms than the input terms. 
Lemmas 15 and 16 give independent conditions that are each sufficient for one pair of terms to be 
less than another by the ~<¢ ordering. Lemma 15 shows that two non-variable proper significant 
subterms of two terms have strictly smaller complexity than the two terms; it will apply in case 5 of 


CR-unify where the algorithm recurses over non-variable arguments. 


Lemma 15: Lett’, and s:be non-variable proper significant subterms of t or s, then: 
<t', s> ~<e <t, >. 
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Proof. Lety, = #(f.s).r, = (hs)ry = v(t. s') and T= r(t', s'). Since t’ and s’ are subterms of t or 
s. all variables occurrences in! or s’ correspond to some occurrence in t or s. Since t’ and s’ are 
non-variab'e terms. all variables int’ or s’ will occur under an operator they occurred under int or s, 
)ss:ze(e ri It is now sufficient to show TCT); which in this case is independent 


sO rCr, and size(v, 


of whether or not Bev, 
All significant subterms of t’ or s’ are significant in t or s (because ft’ and s’ are themselves significant 
intors). Therefore. finding one significant subterm of t or s that is not a subterm of t’ or s’ will prove 
T,C7, and thus size(7,) < size(r,). Specifically, we will show that either t is not a subterm of t’ and not 
a subterm of s’ or s is not a subterm of t’ and not a subterms of s’ and, each being a significant 


subterm of itself. satisfies these conditions to prove size(t 5) < size(r,). 


Assume. without loss of generality. that t'<t. We also know that either s'<t or s'~s: 


1. If s'<?, then t is not a subterm of s’ and t is not a subterm of t’. 

2. If s <‘s, then there are three cases: 
A. If's’~<t' then s'<t. sot is not a subterm of s’ and tis not a subterm of 1’, 
B. If t'~s‘ then t'~‘s, so s is not a subterm of s’ and s is not a subterms of f’. 


C. If t's’ and s'-4t'. then both t and s are not subterms of t’ or of s’. 


Therefore, <t’, s> ~<e <td. Oo 


Lemma 16 gives sufficient conditions for a pair of subterms of two terms to be strictly less than the 
two terms even when one of the subterms is a variable. It will apply in case B of CR-variable-unify 


where the algorithm recurses over a variable and a non-variable term. 


Lemma 16: Lets’ be anon-variable proper significant subterm of t or s and let x be a variable such 
that size(ParSets{x , t)UParSets(x, s))> 1, then: 

<x, S> <e <t, 8? 
Proof. Lety,=v(t,s), 7, = 7(t,s), Py =v(x, s'), and T, = (x, s’). Since s’ is a subterm of t or s, all 
variable occurrences ins’ will occur under the same operator sets in either t or s and since x will be in 
pv, its new occurrence under {n} will not place any new variables in v. Therefore, v,Cr, and 


size(v,) $ size(v,). By the same argument as in Lemma 15, ToCT,, and thus <x, s> ~<e <t, S?. Oj 


Lemmas 15 and 16 will not be enough to show that the complexity decreases with recursion, since 
map-unify applies the accumulated unifier to the next corresponding pair of terms before calling 
CR-unify. In general, applying a substitution to a term may increase its complexity. Therefore, the 
following set of definitions and lemmas are used to give sufficient conditions on a substitution and 


pair of terms such that the substitution will not increase the complexity of the pair of terms. 
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Each substitution created directly by the CR-unify procedure is of the form {x « r}, where x is a 
variable and r is aterm. A substitution of this form has a domain of size one or zero and will be called 
an elementary substitution. The following definition gives sufficient conditions for forming elementary 
substitutions for a pair of terms to assure that the complexity of the terms will not increase with 


application of the substitution. 


Definition. Let o be an elementary substitution. o is said to be elementary non-increasing for t and 
s if and only if it is of one of the following forms: 
1. {t « s}, where t is a variable not occurring in s, or similarly, the substitution {s « t} where s is 


a variable not occurring int. 
2. {x « r}, where x €1(r) and (ParSets(x, t)UParSets(x , s))N(ParSets(r, t)UParSets(r, s)) # @ 


3. {x © »}, where either y is a new variable or 
(ParSets(x , t)UParSets(x, s))M(ParSets(y, t)UParSets(y, s)) # © 


4. {x © r}, where ris a non-variable homogeneous term such that x ¢ Yr) and 
[r.head]€ParSets(x, t)UParSets(x,s) and V y € Yr) either y is a new variable or 
[r.head]€ ParSets(y, )UParSets(y, s) 


5. {t © r}, where r is a non-variable homogeneous term such that t€¥(r) and 
[s.head]€ParSets(t, s) and r.head =s.head and Vy €Y(r) either y is a new variable or 
T 
[r.head]€ ParSets(y, t)UParSets(y, s). (Similarly for t and s reversed.) 


6. {x « r}, where ris a non-variable homogeneous term such that x ¢ Yr) and there exists a 
significant subterm, r’, of either t or s such that r.head =r'.head and 
(ParSets(x , t)\UParSets(x, s))M(ParSets(r'’, NUParSets(/’, s)) # @ and 
[r' head]€ParSets(x, r') and V y €V(r) either y is a new variable or 
[r.head]€ParSets(y, t)UParSets(y, s). 


Lemma 17 states that each of the conditions in the definition of elementary non-increasing sub- 
stitutions is sufficient to guarantee that application of such a substitution will not cause the com- 
plexity of the terms to increase by the ~<e ordering. 

Lemma 17: Ifo is an elementary non-increasing substitution for t and s, then: 

<ot," oS><ekt, >. 
Proof. Lety, = v(t.s),7, = r(t,s), v, = v(ot,as), and 7, = t(ot,os). Consider each case from the 
definition of elementary non-increasing substitutions: 


1. In this case ot = as=s and since tf was not in v4, we know Yoav, and To=T, which implies 


their cardinalities are also pairwise equal. 
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2. 


oO 


Since r is a subterm of either t or s, »,Cv,. If v,=v,, then all occurrences of x must have 
been under operators from a single sub-theory (i.e., size(ParSets({x, t)UParSets(x, s)) = 1). 
Furthermore, r must occur under an operator from this set since r and x have some parent 
set in common. Therefore, the new occurrences of rin of or os will be significant if and only 


if there was a significant occurrence of r in t or s and, thus, TCT. 


If » is a new variable, then o is simply a variable renaming, so size(v,) = size(v,). Otherwise, 
there is a common parent set of x and y, so size(v,) $ size(v,). In either case, 


size(r,) s size(t,). 


. Let» €V(r). If.y is a new variable, then all occurrences of y in ot or os are in r and, since r is 


homogeneous, size(ParSets(y, t)UParSets(y, s))= 1. Otherwise, there is a common parent 
set of y and x, so y will occur under symbols from more than one sub-theory in of or os 
only if either x or » did in t or s. Therefore, size(v 4) s size(v,). If size(v,) = size(v,), then x 
must have occurred under only one parent set and this set is [r.head]. Therefore, any 
occurrences of r in of or os will not be significant and by homogeneity of r, none of its 


subterms will be significant, so size(r,) s size(r,). 


.t occurs under {n} and [s.head], so size(ParSets(t, t)UParSets(t, s)) > 1. All other variables 


in r are either new, and by homogeneity of r have only one parent set, or already occur 


under [r.head] in t or s. Therefore, size(v,) < size(v,). 


. x occurs in t or s under both [r’.head] and the set of symbols distinct from [r'.head] under 


which r’ occurs to make it significant. Therefore, x occurs under more than one set of 
symbols in t or s and not at all in of. All other variables in r occur under multiple parent sets 


in ot or as only if they did in t or s by the argument in case (5), so v, = v-{x}. 


| 


We extend the definition of non-increasing to general substitutions as well as elementary ones by 


considering a composition of elementary non-increasing substitutions. 


a is said to be non-increasing for t and s. 


stitution to the given pair of terms does not increase their complexity. 


Definition. Ifo = o,°...°c, and a, is elementary non-increasing for o, ,...0,f and g; ,...0,8, in, then 


Notice that the definition of non-increasing reflects the way in which substitutions are built in 


Lemma 17 to substitutions that are not elementary. 
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map-unify; one unifier is found and applied to the inputs before next unifier is found. Lemma 
18 proves the desired property on non-increasing substitutions, i.e., applying a non-increasing sub- 


It extends the property in 


Lemma 18: If ois anon-increasing substitution fort and s, then: 
<ot, IS>SeCt, S>. 


Proof. By induction on n using Lemma 17. O 


Lemma 19: Lett’. s’ be subterms of t or s such that 

(ParSets(t’, t)UParSets(t’, s))V(ParSets(s’, t)UParSets(s’, s)) # @. If o is non-increasing for t' and s’, 
then o Is non-increasing fortand s. 

Proof. The proof is done by examining the elementary factors of o to show that each of the con- 
ditions that makes it non-increasing for t’ and s’ corresponds to some conditions that make it non- 
increasing fort and s. 


1. {t'« s'} corresponds to form (2) fort and s. 


2. {x © r}, where ris a significant subterm of t’ or s’ is still form (2) fort and s since 
ParSets(x, t')UParSets(x.s) C ParSets(x,t) U ParSets(x,s) and 
ParSets(r, t')UParSets(r, s) C ParSets(r, t) U ParSets(r, s). 


3 {x © y} fort’ and s’ is still form (3) for t and s 
ParSets(x, t )UParSets(v,s) C ParSets(x, 1) U ParSets(x,s) and 
ParSets(v, t)UParSets(s.s') C ParSets(y, t) U ParSets(y,s). 


4. {x ¢r}, where ris a homogeneous term is also form (4) for t and s since 
ParSets(x, t)UParSets(x,s') C ParSets(x,t) U ParSets(x,s) and 
Vy EVN?) if [r.head]€(ParSets(y ,t')UParSets(y ,s')), then 
[r.head]€ (ParSets(y ,t)UParSets(y,s)). 


5. {t « r}, where r is homogeneous in [s.head] is form (6) for tand s. The terms’ acts as /’ in 


form (6) of the definition. 


6. {x « r}, where r is homogeneous and there is an r’ as described, is still form (6), since all 
significant subterms of t' and s’ (including t’ and s’, themselves) are significant in t or s. 
oO 


The following technical definition will be used in our proof of termination to show that non-trivial 
recursive calls are made only to pairs of terms that are significant and appear under a common set of 
symbols in the input terms with accumulated partial unifiers applied. The property is not obvious in 
the proof since we are building the corresponding pairs of terms out of substitutions p and y, not 


directly from t and s. 


Definition. Let t and s be two homogeneous terms in F, the constrained symbols for a sub-theory of 
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E* and let p be a substitution that is homogeneous in F, Furthermore, assume either f or s is non- 


variable. A substitution, o, is said to be parent preserving for homogeneous t, s, and p, if and only if: 


(x EX & x EF(a)UI(a)) => FEParSets(ax, ot) U ParSets(ox, os), 
where X= M1P)UNS)USI(p). 


Informally, X contains the variables from t or s that were not cut off though homogenization, the 
new variables from homogenization, and any new variables from p. The intuition is that if one of the 
variables occurs in the range variables of o, then it must occur somewhere in ot or os, and, further- 
more, one of those occurrences must be under F. In addition, if one of the variables, x, occurs in the 
domain of a, then the range element ox must occur in of or os under F. It is called the parent 
preserving property because the variables in X all occur under F in either f, S$, pf or p$, and we want 


this parent relationship to be preserved, even when the unifier, o, is only partially formed. 


3.4.4 The Proof 


Theorem V is the termination theorem for the CR-unify procedure. The proof is by noetherian 
induction on the complexity of terms in the calls to CR-unify. The induction step is proved using a 


second induction on the number of calls to CR-unify made from a single invocation of map-unify. 


The proof is quite long and involved. Before presenting the proof in its entirety, we will give a 
short outline of the proof’s structure. This should give the reader a feeling for the purpose of each 
step in the proof and also act as a check-list for the things that have been proved and the things still 
left. The proof is by induction ont and s using the ~<e ordering. The proof uses a stronger induction 
hypothesis than the property of termination alone. in addition to termination, we prove that returned 
substitutions are non-increasing for t and s. 

Proof idea. 
1. Basis: Cases 1 and 4 of CR-unify and cases A and C of CR-variable-unify, as invoked in case 

2 or 3 of CA-unify. We need to prove that each step terminates and that the returned 


substitution is non-increasing for t and s. 


2. Inductive step: Case 5 of CR-unify and case B of CR-variable-unify (as called by case 2 or 3 
of CR-unify) are the inductive cases, since these two cases require recursive steps. These 
cases can be considered together, since both t and f for case 5 are v in case B. In this step 
we will show that calls to map-unify terminate. For simplicity, we treat the algorithms as if 
they always pick some element of a returned set of unifiers rather than exhaustively trying 
each element of the set. Since these sets of unifiers are finite, this simplification does not 


affect the soundness of our proof. 
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We will perform an induction on the variables in the domain of y and p. These variables are 
divided into two sets denoted V, and Vo. where Vy contains variables representing non- 
variable subterms of t or s and V, contains variables from the original t or s. The induction 


on these variables is divided into two parts, one for V, and one for Vp: 


In addition to proving the termination of each step and the non-increasing nature of unifiers, 
we will show that o. is parent preserving for f, §, and p. The proof that 0; is parent preserv- 
ing, will depend only on the induction on i, not on the induction on t and s since in the base 
cases p, f, and § are not defined. 

A. This step is for variables in Vv, 


i) Basis: i= 0, the first variable in the V): 
ii) Inductive step: i>0, the rest of the variables in V,. 


Show that this step terminates and any substitution formed is both non-increasing 
+4 


fort and s and parent preserving for f, §, and p. 


B. This step is for variables in Vo: 


i) Basis: Since V, and Vo are processed in order, this is simply the last case in Vy. 


ii) Inductive step: For the variables in V, 


Show that this step terminates and, again, that any substitution formed is both non- 
increasing for t and s and parent preserving for f, $, and p. We break this step 
into five cases depending on the value of s,, the term formed from p, and t,, the 


term formed from y. 


Given this rough outline, we now give the termination theorem with complete proof. 


Theorem V: For any terms, t and s, CR-unify(t, s) terminates. 
Proof. By induction ont and s using the ~<e ordering. 

1. Basis: Cases 1 and 4 of CR-unify and cases A and C of CR-variable-unify, as invoked in case 
2 or 3 of CR-unify. In each case termination is obvious; We will prove that a returned 
substitution, a, is non-increasing for t and s. 

A. Case 1: o= {tes} is non-increasing because it is either the identity substitution or of 


form (1) in the definition of elementary non-increasing. 


B. Case A: o = {tes} is of form (1) in the definition of elementary non-increasing. 
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C. Case C The empty set of substitutions is returned. so the properties are vacuously true. 
D. Case 4: Again. the empty set of substitutions is returned. 


_ Inductive step: Case 5 of CA-unify and case B of CR-variable-unify (as called by case 2 or 3 
of CR-unr',) are the inductive cases. Let E* be the sub-theory of interest and F be the set of 


function symbols constrained by E*,i.e., F = [s.heaa]. 
E _CR-unify terminates by the basic assumption on sub-theory unification procedures. 


Map-unity is called with y. the preserving substitution for t and s, and p, an E.-unifier. Induct 
over i, the number of iterations in map-unify, and for each i use the following notation: 
v EF(y)UT(p) 
Le Bear 
oO ON 
w €CR unify(t, s) 
GO, = W°O., 
Using this notation, we know a; is of the form: 
O, = WOW, 19... ° Wo. 
The set of domain variables is divided into M, and Vo: we will consider each case separately, 
first performing induction on i where v EV, and then where v.€V,. We show that o; is 
parent preserving for f, S, and p. 
A. V.cV, = 4(p)-d(y): We know by construction of V, thatyv; = vy. In this case, each 
unifier is of the form w, = {v,¢o, py}. 
i) Basis: If i=0, then no cails to CR-unify have been made and termination is obvious. 
The only substitution is Og =h which is trivially non-increasing and parent preserv- 
ing. 


ii} Inductive step: D0. 


OG, = W, 40. Wo and by the induction hypothesis on i, the w’s are all elementary 


i-1 
substitutions of the form w, = {vo, pv}. Therefore, Blo, )ELY -?} and, fur- 


thermore, v,€5(a; ,), SOt =o, ,YV,=9, 49,7). 


Also, by construction of V, we know v €5(p), which implies v, €3(p) and thus 
v€5(o, ,). Therefore, v€No, ,py)). 


Since t, does not occur in s. = o,pv,, the unifier is w, = {+s} ={v,«o, , pv, ,}, which is 


case A of CR-variable-unify, and obviously a terminating case. 
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w,, is homogeneous in F and elementary non-increasing for o, ,t and a, ,s because it 
is of form (3), (4), or (5) in the definition of elementary non-increasing. There- 


fore, g, is non-increasing fort and s by Lemma 18. 


v, occurs under F in either t or s, and a, is homogeneous in F, so the parent preserv- 
ing hypothesis holds. (Note: this is true even if vis t, or s, as in case B of 
CR-variable-unify, since vy, must have another occurrence under F in the non- 


variable term.) 


B. v.€V, = Dy): 


i) Basis: The proof for v, in V, proves the base case for Vo: 


ii) Induction Step: 


Since v €S(y) we know yv. and thus t. is a non-variable such that the head symbol is 


not in F. Therefore, t. is a proper significant subterm of o, ,f or a, ,s. 


There are five cases to consider for 5) 
a) If s, iS a non-variable such that s,.head#t,.head, then the recursive call ter- 


minates with the empty set in case 4 of CR-unify. 


b) If s; is a variable such that set) & s.€3(Preserve(f., t)), then the termination 


halts with the empty set in case C of CR-variable-unify. 


c) If s, is a variable such that s€Nt) then the single substitution returned by 


CR-unify{t,, S:) is w, = {s,t,}. 


@; is of form (2) for gt and 0, 48 in the definition of elementary non- 
increasing, and since the induction hypothesis on i implies oi, is non- 


increasing fort and s, D,= 0°, is non-increasing for t and s. 


1 


Termination is immediate in the recursive call since t; and S; fit case A of 


CR-variable-unify as called from case 3 of CR-unify. 


To show the parent preserving hypothesis on a, we will consider two cases: 
the variables in the domain of o and the variables in the range of 0. 
Recall the variables of interest, X, are either variables in t or s with occur- 
rences under F, new variables from homogenization, or new variables 


from E.-unify. 
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Consider x in B(o,), and show that o,x will occur under F in either of or 
g, ,8. This is evident not because of any conditions of x , but because 
yy, occurred under F in t or s and therefore ox =s,= 9, ,YV; will occur 


under F in of oro. 


Let x be in J(o,) and show that x will occur under F in of or as. i 
x €J(a,) then either x €35(0; ,) or x EUs). Variables, other than those 
in S(w.), that are in J(o; ,) will occur under F by the induction 
hypothesis on i. Variables in S,= 90, ,yV, are either in ¢t ors or in J(a; ,). 
in the first case, if variables occur in t or s and in X, then they must 
have occurred under F in either t or s. In the second case, variables 

and X will occur under F in o, ,t or o, ,s by the induction 


in o,, 


hypothesis and thus under F in ot or as. 


d) If s is a non-variable such that s,.head = t.head, then: 


Since [t,.head]#F we know [s,.head]#F. But s;= 9, ,pv,, SO p maps y, to itself 
or to some other variable. In either case there is some variable, y€D(a, ,) 
and y€X such that Oy = §;,. Therefore, by the parent preserving 


property on a, 


,.41 §, occurs under F in either o, ,f oro, ,s. 


Therefore, t; and s; are both non-variables significant subterms of o, ,f or 


g, ,S and by Lemma 15, <t,, 5>~<@ <a; jt, 0, 5. 


By the inductive hypothesis on i, o, , is non-increasing for t and s and thus 


<o, 4f, G, Met, s> by Lemma 18. 


By transitivity, <t, S>~<eXt, s>. lLe., the recursive call is made to strictly 
smaller inputs than the original inputs, and we can apply the noetherian 


induction hypothesis to prove termination. : 


By the induction hypothesis on t, and s., if w; is a returned substitution, then 
w; is non-increasing for t, and s,, and by Lemma 19 for t and s since t, and 


s both occur under F. 


The parent preserving property is proved by showing that if x ED(o)JUI(a,) 
then 9,X OCCUTS under F in of OF as. Since O,= WOO, 4, x is either in 
Bo, ,)UI(o, 5) or in B(w)UI(w,). In the first case, the parent preserving 
property in a; follows directly from the induction hypothesis on Gi 4 In the 


second case, if x is in B(w,) then it must be in f, ors, and if x is in J(w,) 
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then it must be int. sor it must be a new variable and therefore not be in 
X. We tnerefore need only consider the variables in ¢, and s.. The vari- 
ables in ? are either in 5(y). and thus in f or s. or they are in J(o, 4): the 
variables in s. are all in 3(o, ,). All variables in J(a, ,) are covered by the 
induction hypothesis, and variables in t or s occur in X only if they occur 


under F, so w, and hence o, are parent preserving. 


e) ifs, is a variable such that s.€i1t,) and s.€5(Preserve(F., t), then: 
Since s.= 9, ,py,, is notin f or s but is int, we know s.€5(o, 4). 


t€iis) => t€3(o, ,°p). Therefore, either t€5(p) or there exists some v€5(p) 


such that =o, Jy. 


in either case, the inductive hypothesis applies to give the parent preserving 


property on a , and s-(i) occurs under F. 


But, s, also occurs under [t.head]. Therefore, size(ParSets(s, o, ,t)U 


ParSets(s. 0, ,s))> 1 and by Lemma 16, <t,, s>~<<o, ,t. 0, ,5?. 


By the same arguments as in case 4, <t. $< <t, s> and we can use the 
inductive hypotheses to show O,=W°O , is parent preserving and non- 


increasing fort and s. 
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Chapter Four 


Conclusions 


This chapter summarizes the main contributions of this thesis and suggests areas for future work. 
We look specifically at possible extensions of CR-unify and some issues and related to the complexity 


and efficiency of our approach. 


4.1 Contributions 


In this thesis we defined a generalization to the equational unification problem which we called 
combined unification. The problem is to take a set of equational theories, each of which has a known 
equational unification algorithm, and automatically produce an algorithm for the union of the theories. 
The problem solved here, where we restricted ourselves to confined regular theories, was mentioned 
as an open problem by [Shostak 84] as a generalization to his work in combining decision 
procedures. It is also described as an open problem in[Siekmann 84], where it is called the 


combination of theories problem. 


The combined unification problem was motivated both by a theoretical interest in learning more 
about equational unification and a pragmatic interest in using equational unification in larger systems 
such as theorem provers. Many applications of unification involve reasoning about an arbitrary sets 
of operators with different set of equational properties. For example, in an automatic theorem prover 
the set of needed equational theories will depend on the theorem to be proved. Moreover, given the 
difficulty of designing equational unification algorithms, it is not reasonable to assume the algorithm 
will be designed “on the fly," while proving the theorem. These considerations lead us to conclude 
two things: systems based on equational unification must have a large set of built-in theories, and, this 
set must be easily extendible so that new theories can be incorporated as they are deemed inter- 


esting. 


Our approach was to describe a unification algorithm for equational theories that are the union of 


theories having known unification algorithms. The main results of this thesis are: 


- a careful definition of the combined unification problem, 


- characterization of a sub-problem of combined unification for which a solution was pos- 
sible, and for which extensions outside the sub-problem domain are difficult, 
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- design of an algorithm for combining equational unification algorithms for confined 
reguiar theories, 


- proofs of consistency, completeness, and termination for the algorithm, 


-a method for a lifting the variable-only case to the case with free symbols and multiple 
instances using our combining algorithm, 


- an implementation of the algorithm. 


Our definition of the combined unification problem is based on the existence of what we called a 
partitioned presentation of the theory. A partitioning on the presentation characterizes the indepen- 
dence of its sub-theories, i.e., the theory presented by each of the partitions. This intuitive notion of 
independent sub-theories corresponds to having disjoint sets of constrained symbols for each sub- 
theory. Independence is essential to the ability to automatically combine unification algorithms as 
evidenced by the fact that combining three theories with decidable unification problems, namely 
associativity, left distributivity and right distributivity, results in a theory with an undecidable unifica- 


tion problem. (See Figure 1-4 for references.) 


The sub-problem of combined unification that we chose to solve was combining confined regular 
theories. Both of these properties give sufficient conditions for finding pairs of terms that are not 
unifiable. We defined a confined theory to be one in which there are no equations with a variable 
equal to a non-variable term. This gave us the invariant that any two equal non-variable terms have 
head symbols constrained by the same sub-theory. A regular theory is a theory in which the right and 
left side of each equation contains the same set of variables. Together with the restriction to confined 
theories, the regularity restriction guarantees that every pair of equal terms has equal sets of sig- 


nificant subterms. 


We also showed in Section 2.4 that confinement and regularity were not restrictions of con- 
venience, but that both our algorithm and a number of simple extensions to it were not correct for 
unconfined or non-regular theories. Taken collectively, the examples in Section 2.4 point out a 
conflict between the goals of completeness and termination of E-unification procedures. In many 
cases it was possible to either prove completeness assuming termination or to prove termination of an 


incomplete procedure. 


The examples in Section 2.4 also serve as counter-examples to the correctness of some existing 
E-unification algorithms when more than one instance or free symbols are allowed. The abelian 
group unification algorithm and the algorithm for AC unification with either idempotence or unit are 


examples of algorithms that are not correct in the more general cases. The ACI and ACU theories 
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have variable-only case algerithms defined in [Livesey 76]. [Fages 83b] shows termination for the 
genera! cases of AC! and ACU but neither his algorithm nor the algorithms in [Livesey 76] are com- 
plete in these cases. in particular. they do not find a unifier when unifying a variable with a non- 
variable term containing the variable. For the same reason. the algorithm for abelian group unifica- 


tion [Lankford 84] is incomplete in the general cases. 


Our algorithm is a combining algorithm for the confined regular theories. It automatically builds 
an algorithm for the combined unification problem by dividing the combined problem into a number of 
variable-only problems, invoking algorithms for the sub-theories on those problems, and recombining 
the answers. A homogenizing function is used for the problem division and unification of substitu- 
tions used for the recombining. The technique requires no redesign of the sub-theory algorithms, and 
no theory-specific computation in the generalized algorithm. The restrictions of regularity and con- 


finement are implicitly used for correctness, but are never explicitly tested by the algorithm. 


The practical significance of our results depend upon the existence of confined regular theories 
for which the combined unification problem was previously unsolved, and for which the theories 
describe "interesting" properties. The permutative theories are examples of such theories. The 
insert operation on the set data type. for example. has the property of right commutativity, a permuta- 


tive property that also has known unification algorithm. 


An algorithm for unifying permutative axioms is useful in term rewriting system completion 
procedures and in resolution, because both of these procedures make use of classical unification and 
are limited by the necessity of maintaining termination. The permutative axioms are a class that will 
always lead to termination problems, since they can be applied repeatedly. The non-permutative 
axioms can often be handled directly by the application, such as resolution or term rewriting, without 
loosing termination. {tis often the case that equations leading to termination probiems in an applica- 
tion can be handled by combined unification, while the equations leading to problems in combined 
unification can be handled by the application. Referring again to the set example, we note that most 
axioms for this data type can be handled in a term rewriting system by placing them in the rules. An 
exception is the right commutativity of insert, which would violate properties necessary for termina- 
tion. Right commutativity can be handled by our CR-unify algorithm, and is allowed in the unification 
algorithm in an equational term rewriting system. We therefore have a method for proving theorems 
about sets of terms or sets of sets by incorporating the combining unification algorithm with right 


commutative unification into an equational term rewriting system. 


Another class of theories we have discussed are those theories presented by only ground equa- 
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tions. These theories have an automatically generated unification algorithm through narrowing 
{Hullot 80], and are also confined and regular. While the case with free symbols and multiple in- 
stance case are solved by the narrowing process, our algorithm provides a method for combining 


these narrowing unification algorithms with arbitrary other algorithms for confined regular theories. 


A proof of total correctness for the CR-unify algorithm was given in Chapter 3. The proof was 
presented in three separate theorems: Theorem lil shows consistency, Theorem IV completeness, 
and Theorem V termination. The consistency and termination properties are proved for all theories, 
while completeness is proved for the regular confined theories. The consistency and completeness 
proofs assume termination, and therefore constitute a partial completeness argument. Combined 
with the termination proof this shows total correctness. The termination proof was based on Fages’s 
proof of termination for AC-unification. However, the extension was non-trivial, particularly because 
case B of CA-unify is a failure case in the AC theory, whereas case B is a recursive case in CR-unify to 


handle theories presented by ground equations. 


A consequence of our method and the proofs is that any variable-only case algorithm for an 
equational theory E can be automatically lifted to both the multiple instance case and case with free 
symbols. The importance of this is demonstrated by considering an example. A unification algorithm 
for the AC theory was originally described by Stickel in 1975, and although the procedure was pur- 
ported to solve the general cases for AC, termination in the general case was unproven until 1983, 
when it was solved by Fages. The AC theory is one example of a confined and regular theory; Fages’s 
generalization of Stickel’s algorithm is a special case of our CR-unify just as his termination proof is a 
special case of our termination proof for CR-unify. The unification problem with free symbols is also 
immediately solved by our algorithm for any theory with a variable-only case algorithm, because the 


unconstrained symbols are handled by the empty theory unification algorithm. 


The ability to go directly from a variable-only case solution to the more general solutions also give 
us some bounds on the number of unifiers in certain theories. As a corollary to our proof of total 
correctness, termination in the variable-only case of a confined and regular theory implies termination 
in both the multiple instance case and the case with free symbols of this theory. This in turn implies 
the existence of a finite complete set of unifiers in these more general cases. Conversely, if either the 
case with free symbols or the multiple instance case is known to be infinite for a given theory, then the 


variable-only case must also be infinite. 


The CR-unify algorithm has been implemented as part a general effort to extend the REVE term 


rewriting system generator [Lescanne 83, Forgaard 84b] to equational term rewriting systems 
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{Kirchner 84b]. The implementation supports the generalized unification algorithm and allows for 
simple modular extension to new sub-theories as their unification algorithms are implemented. In the 
current version, the unification algorithms for the AC and empty theories have been implemented. 
The implementation of the REVE system, including our unification algorithm, was done in CLU [Liskov 


81]. 


For efficiency reasons, the implementation differs from the description given in this thesis. Each 
E-unification procedure is implemented to perform unification in a single equational theory, making 
no assumptions about the properties of the operators in the subterms, but recursively calling the 
top-level general unification procedure to unify subterms rather than returning homogeneous sub- 
stitutions and combining them through unification of substitutions. This eliminates the overhead of 
forming the homogeneous terms. The sets of substitutions that appear in both CR-unify and 
map-unify are replaced in the implementation with CLU iterators. This simplifies the code and may 
allow for combining non-terminating unification procedures, such as that for the associative theory 
[Plotkin 72]. Finally, as in Fages’s implementation of AC-unification, the order of recursion imposed 
in map-unify by forming V, and Vo is not preserved. l.e., variable elements in &(y) are not necessarily 


unified first. The assumption on the order of recursion is made only to simplify the termination proof. 


4.2 Future Work 


The work in this thesis has suggest some areas for further research. We will discuss, on a 
pragmatic level, some techniques for improving the efficiency of our algorithm. We also consider the 
problem of weakening the restrictions on the equational theories allowed in the combining algorithm, 


and present directions for further work in this area. 


4.2.1 Efficiency Issues - 

The feasibility of using E-unification in applications will depend in part on the ability to find 
reasonably efficient algorithms for performing the unification. Although the combined unification 
problems are inherently hard for many interesting theories, there are a number of optimizations that 


will improve the running time in practice. 


One of these optimizations comes from the difference between U-Homog and Homog, defined in 
this thesis. CR-unify is still correct if U-Homog or other homogenizing functions are used in place of 
Homog in the procedure. There is a trade-off between the efficiency of the homogenizing function 


and the number of unifiers of the homogenized terms, since terms with multiple occurrences of 
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variables have fewer unifiers than terms in which each variable is unique. The best homogenizing 


function may be one that performs differently depending on the theory of the head symbols. 


The order of recursive calls is also very important to the running time of combined unification. In 
practice it is best to perform the simplest recursive calls first. especially when they will lead to failure. 
Ordering of recursive calls can be done partially on the basis of the kind of terms to be unified (i.e., 
variable versus non-variable) and partially on the basis of the relative difficulty of performing a unifica- 


tion in the different theories [Fages 85]. 


Perhaps the most interesting class of optimizations would involve weakening our strict boundaries 
between sub-theory unification algorithms. There are sufficient conditions on terms for non- 
unifiability that can be checked very quickly, e.g.. clash of head symbols. Sometimes, a sub-theory 
unification algorithm can make use of this information during its processing and thereby never 
produce unifiers that would require two obviously non-unifiable terms to be unified. A clash between 
symbols from different sub-theories could be detected in the current structure of the algorithm, but 
more general kinds of checks for non-unifiability are specific to a theory and would therefore requires 
sharing information between unification algorithms. Our current implementation does not make use 
of this kind of information because the emphasis was on correctness and modularity of the program 


rather than efficiency. 


A measure of complexity that exists for unification problems, and is also related to the efficiency in 
practice, is minimality of a solution. An algorithm that is totally correct but produces many non- 
minimal unifiers will be too inefficient for some applications. Minimality is probably too expensive to 
require of unification algorithms, since in some theories this would require an exponential filtering 
process. However, non-minimal unifiers affect the execution time of both the algorithm and its ap- 
plications and thus one measure of a good algorithm should be that it produces few non-minimal 
unifiers. A related open problem is to find a minimal combining unification algorithm that avoids 


exponential filtering. 


Efforts to gain significant improvements in efficiency through parallel processing are limited by 
some lower bounds in that area. First, note that E-unification problems for which the best algorithm is 
exponential will have at best exponential paralle! algorithms, since we have only a polynomial number 
of processors. For theories in which polynomial unification algorithms exist, improvements through 
paralle! processing may be possible. However, the fact that empty theory unification is inherently 


sequential [Dwork 84] is not promising. 


4.2.2 Removing Restrictions on the Theories 


A challenging area of research that is not addressed in this thesis is the problem of combining 
theories in which constrained symbols are not disjoint. Some of the negative results on the 
decidability of unification problems indicate the difficulty of this problem. The undecidability results in 
this area have been based on the undecidability of Hilbert’s 10" problem, solving Diophantine equa- 
tions over the integers. which was shown undecidable by Matiyasevit [Davis 73]. [Arnborg 85] and 
{Szabo 78] investigate combinations of theories with non-disjoint function symbols by studying the 
lattice of sub-theories that are consistent with Peano arithmetic. [Szabo 78] shows the undecidability 
of the associative theory with two sided distributivity, AD, while [Arnborg 85] shows the undecidability 
of the associative theory with one-sided distributivity and a right and left identity element, ADU or 
AD_U. In both cases it was also shown that any theory consistent with Peano arithmetic and contain- 


ing the AD or AD,U theory, respectively, also has an undecidable unification problem. 


The CR-unify described in this thesis cannot handle non-terminating procedures for enumerating 
unifiers. For the sake of notational convenience, the CR-unify invokes a sub-theory algorithm which 
returns a complete set of unifiers, although processing of unifiers could be done one at atime. it may 
be possible to get a combining algorithm for non-terminating unification procedures by processing on 
sub-theory unifier before the others are generated. The interesting problem here is to show that the 


resulting procedure is a complete generating procedure for the combined theory. 


An obvious problem that is left open in this thesis is combining unification algorithms when the 
theories may be either unconfined or non-regular. The problems that arise in trying to extend 
CR-unify were characterized by the examples in Section 2.4. We found, in general, that it was not 
hard to guarantee consistency, but that the properties of termination and completeness seem to 
conflict. It was possible to get a provably complete procedure if one assumed termination, while in 
actuality the procedure would loop in a trivial manner before any unifiers were generated. Alter- 
natively, a terminating algorithm could be achieved, but it was found that the algorithm was incom- 


plete on some non-trivial examples. 


Before our algorithm can be extended to unconfined or non-regular theories, further theoretical 
work must be done. The experience gained in this work shows that the problem of generalizing 
unification procedures is not trivial, that seemingly obvious approaches are not always correct, and, 
therefore, that algorithms in this field require detailed descriptions and careful proofs of consistency, 


completeness, and termination. 


Appendix A 


Protection of Variables in CR-unify 


A technical issue that we have avoided discussing in detail until now is the generation and protec- 
tion of new variables in CA-unify. A unification algorithm is often used in a larger system, and that 
system may have variables of its own, and it is important that any new variables generated by CR-unify 
do not coincide with those existing externally. This problem could be handled by simply renaming 
variables after performing unification, but the more general problem comes up within the CR-unify 
algorithm because of the recursion. It is important to all three correctness properties, i.e., consis- 
tency, completeness and termination, that new variables generated on recursive calls do not coincide 
with those existing in subterms not involved in the recursion. For example, if two new variables 
appear together and both use the same name, the resulting unifier may be less general than intended 
substitution. In particular, the property of idempotence of unifiers, used in the proofs, depends on the 
disjointness of domain and range variables. This disjointness can only be guaranteed if the variables 
in the domain of one factor of a substitution can be protected from appearing in the range of another 
factor. The parent preserving property of the termination proof also depends on the protectiveness of 


recursive calls to CR-unify. 


In the implementation the protection problem corresponds to the problem of generating globally 
unique identifiers from within any local procedure environment. The solution in the implementation is 
to pass an object for generating unique identifiers to each unification procedure and to guarantee a 
priori that all variables in the input are disjoint from any variables that may be generated. We do this 
by picking a special prefix for generated variables and concatenating a unique integer whenever a 
new variable is needed. In the formal context, it is more convenient to pass the set of variables to be 
protected than to pass a function for generating identifiers, although the two approaches are effec- 


tively the same. 


We begin by imposing a total order on the universe of variables, V. If V is any set of variables, 
Next(V) denotes the smallest variable in V, as defined by the imposed ordering. The homogenizing 
function, Homog, is then modified to incorporate the protected set of variables. The following 
modification to the definition of Homog will legitimize calling Homog a function, since it will now be 


mathematically well-defined. The definition given here replaces the earlier one. 


Definition. Let F be a set of function symbols. W be a finite set of variables, and t be a term such that 
4} CW. Homeg(t, F, W) is defined as follows: 


1. If tis a variable, then Homog({t, F,) = t. 


2.ift = f(t,.....t,) and f € F, then 
Homog(t. F. W) = f(Homog(t,. F, W,).--.Homog(t_, F, W_)), where W, = Wand for i>, 
W=W.,U VHomogit, ,. F.W. ,)). 


Figures 4-1 and 4-2 show the CR-unify and map-unify procedures, respectively. For the sake of 
consistency between Figure 2-3 and 4-1, we will abuse our notation slightly. The shorthand form of 
the homogenizing function, f, was used in the less formal description of CR-unify in Figure 2-3. 
Because it does not allow for specification of the the set of protected variables, we will use the 
longhand form, Homeg, and use f and § as identifiers in the code; they represent the same values as 


in Figure 2-3. 


For simplicity we assume the set of protected variables, W, contains all variables in the other 
inputs arguments. Le. {t)Us) C W in calls to the CR-unify procedure, M{s)U{v} € W in 
CR-variable-unify, and Fg JUS GP )UT(p)UI(p,) CW in map-unify. 
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CR-unify = proc (t, s: term, W:var_set) returns (subst_set) 
case 
is_variable(t) and is_variable(s) => % case 1 
return({{tes}}) 
is_variable(t) and ~is_variable(s) => % case 2 
return(CR-variable-unify(t, s, W)) 
is_variable(s) and ~is_variable(t) => % case 3 
return(CR-variable-unify(s, t, W)) 
t.head * s.head => % case 4 
retu rn (@) 
t.head = s.head => % case 5 
ae Homog(t, t.head, W) 
:= Homog(s, t.head, WUY1f). 
:= Preserve(t, f) U Preserve(s, $) 
= Eneagy UNV (EF, §, WUNANUNS)) 
separ unin y, WUNTNUNS)UI(p)) 


retu (Zan UNs)) 
end 


~ 


Max 


end 
end CR-unify 
CR-variable-unify = proc (v: variable, s: term, W:var_set) returns (subst_set) 
$:= Homog(s, s.head, W) 
y := Preserve(s, 8) 
case 
v € Ys) => % case A 
return ({{v © s}}) 
v € Vs) & v € Hy) => % case B 
P:= Es heaay Uity (v, 8, WUNS)) 
Lis Pap nity y, WUNS)UI(p)) 
retu MZ sure) 
vy € Ws) & v € Hy) => % case C 
return({}) 
end 
end CR-variable-unify 


Figure 4-1: A Careful Description of the CR-unify Procedure 


map-unify = proc Co P:Subst, W:var_set) returns(subst_set) 
V,:= S(p,)-Bg,) 
V2i= Hg,) 
Z:= {i} 
i:=0 
Wo := W 
forj = 1to2do 


for yin Vj do 
i:=i+1 
Z:= {w°o, ,|o,,€ 2, &W, = WU, ,) 
& w.€CR-unity(o, .p,V, 5. 4PoVs W.)} 
end 
retu rn(z.) 
end map-unify 


Figure 4-2: A Careful Description of the map-unify Procedure 


We will use an inductive argument to show the protectiveness of CR-unify. Recall that the con- 


ditions comprising protectiveness of a set = of unifiers of t and s are: 
Vo€X B(o)CV & W-VNI(o) = S 
&D(o)NI(a) = D 


where V={t)UMs). The protectiveness of each step depends on W containing all variables in the 


input arguments, so it is important that recursive calls maintain this convention. 


The applications of Homog to t and s in both CR-unify and CR-variable-unify of Figure 4-1 meet 
this requirement, since 1(t)UY(s)CW. Calls to the sub-theory unification algorithm case 5 and case B 
explicitly add any new variables from f and § to W. The only non-trivial case is the invocation of 
map-unify, and here we know by construction of y that all variables in $(y)US(y) are in either Yi), 
Vs), Wf, or (8). Furthermore, by protectiveness of the sub-theory unification algorithm, 
SpCNhUNS). By explicitly adding J(p) as well as 1(f) and (8) to W, we are guaranteed to cover all 
variables in both y and p. Note that the protected variables in calls to one invocation of map-unify do 


not contain variables generated from a previous invocation of map-unify. 


Map-unify also maintains the convention of having all variables from the arguments in the 
protected set, since it explicitly places any new variables from one invocation into the protected set 


for the next invocation. 


. Given this assumption on W, we can see that CR-unily produces a protective set of unifiers. 
-Casés 1, 2,3, 4, A, and C are obvious. in cases 5 and 6, #9 is a retuned unifier, then S(o)C TUNE) 
by the explicit restriction of 3. The; two properties on the range of 6, (W-V)Nc) = @ and 
Fto)N5(o) = @ follow because « is the composition of substitutions formed by recursive calis to 
 CR-unify, and in each case W is a subset of the protected set in the recursive cal. 


Appendix B 


Glossary of Terms 


A-assignment a mapping from terms to elements of algebra A. 

algebra A set of elements and a set of function on the elements. 

carrier The set of elements in an algebra. 

clash In classical unification, this is the problem that occurs when trying to unify terms 


with different head symbols. 


congruence relation 
An equivalence relation closed under the equality rule, be S stl 8 SP 


L(tynt J~S(S ,.--8,) for all f €F of arity n. 


complete set of unifiers 
A generating set for the set of ail unifiers. 


completeness The property on unification algorithms that guarantees a complete set of unifiers 
is always found. 


confined A set of equations is confined if is contains no equations with a variable equal to a 
non-variable term. 


consistency The property on unification algorithms that guarantees all returned substitutions 
are unifiers. 
constants Function symbols of arity 0, denoted a, b,c, d,0,or1. 


corresponding pair The pair of terms with which CR-unify is invoked from within map-unify. The pair 
is formed by picking a variable in the domain of the two substitutions, applying 
each substitution to the variable, and then applying any previously accumulated 
substitution. 


cycle The problem that occurs in classical unification when unifying a variable with a 
term containing that variable. More generally in equational unification, this hap- 


pens whenever the unification is an infinite term. 


domain The domain of a substitution, denoted 3(a) is the set of all variables mapped to 
something other than themselves, i.e., D(a) = {vlov #v}. 


elementary substitution 
Substitution with a domain of size 1 or 0. 


91 


elementary non-increasing 


ground terms 
head 


homogeneous 


Homog 


instance 


instantiation 
map-unify 


match 


A fairly technical definition giving sufficient conditions on elementary substitu- 
tions such that the complexity (<e) of two terms is not increased by applying the 
substitution. 


The set of terms formable from only function symbols, i.e., no variables. 

The leftmost symbol of a term. 

A term is homogeneous with respect to a set of function symbols is the term 
contains no function symbols outside that set. A substitution is homogeneous if 
all terms in its range are homogeneous with respect some set of function symbols. 


(See Homog, U-Homog, and U-HomogMap.) 


The homogenizing operation on terms, denoted (t), that replaces subterms with 
new variables. 


A term t is an instance of s is t= os for some substitution o. Similarly, a substitu- 
tion, , is an instance of @, if and only if there exists p, such that p,=P,°Py-. Po 
is said to be more general than q, in this case, and the partial order on substitu- 
tions is denoted p, $ 9,. 

A rule of inference used in equational logic From tes deduce ot aes, 


A procedure for finding unifiers of substitutions. 


A substitution mapping a term to an instance of itself. 


minimal complete set of unifiers 


minimal partition 


minimality 


An set of substitutions that generates all unifiers and contains no redundant sub- 
stitutions. 


The smallest partition on a presentation of a theory that preserves disjointness of 
operators. 


A property on unification algorithms that guarantees no returned substitution is an 
instance of another returned substitution. 


more general modulo E 


A partial order on substitutions, denoted <, that is similar to < except = is used 
in place of term equality. 5 


most general unifier 


non-cenfining 


non-increasing 


In classical unification this is the unique unifier of which all other unifiers are 
instances. In E-unification there may be a set of most general unifiers. 


An equation is non-confining if it is of the form v =t or t= v, where t a non-variable 
term. 


A substitution formed of only elementary non-increasing factors. 
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occurrence A of integers, 0, denoting node within a term. For example, S(tyrent, 1.0 =t,70, 
where t/o denotes the term at occurrence o int. 


parent A parent of tin s is an operator in s having t as an argument. The set of all parents 
of tins is written Parents(t,s). 


parent set The equivalence of a parent. The set of all parents sets of t in s is written 
ParSets({t,s). 


partitioned presentation 
A set of sets of axioms with pairwise disjoint operators. 


presentation A set of axioms for an equational theory. 


preserving substitution 
The substitution mapping f to t, which is just the match of fby t. 


proper occurrence An occurrence other than the empty occurrence, e 


protection A property on unification algorithms limiting the variables that can appear in an 
answer, i.e., all answers must be protective unifiers. 


protective unifier A unifier, o, of t and s is protective if is domain and range contain disjoint sets of 
variables, the domain is a subset of the variables in t and s, and any variables in 
the range of o than do not occur in t or s are new variables. 


quotient algebra The algebra formed from another algebra by taking equivalence classes of ele- 
ments of the second as elements of the first. For example, J; hs denotes the term 
algebra modulo an equational theory E*. 


range The range of a substitution is the set of terms to which some element of the 
domain is mapped. 


regular An equation is regular if the right and left sides contain identical sets of variables. 
A set of equations is regular if every element is regular. 


relevant function symbols 
The set of function symbols constrained by a sub-theory. In most cases this is the 
set of function symbols in a presentation of the theory given in a; in the case of 
the empty theory, this is the set of symbols that do not appear in any axioms. 


significant occurrence 
An occurrence, o is significant if the occurrence just above it does not have the 
same operator as o, where same means =. All variable occurrences as well as the 
empty occurrence are significant. 


significant subterm A subterm is significant if it occurs at a significant occurrence. 


strict occurrence A non-variable occurrence. 


strictly consistent A theory is strictly consistent if and only if x =r is notin the theory. A theory that 
iS not Strictly consistent contains all equations. 


strict theory A theory is stnct if for any set of unificands having a unifier in common, the 
transitive closure of ~<j, iS a strict ordering on the set. 


strongly complete theory 
A theory is strongly complete if for any pair of terms, a variable, x, and non- 
variable, t, if v and t are unifiable then there is complete set of unifiers such that 
every substitution in the set has a domain of {x}. 

sub-theory if E is an element of 7, then E* is a sub-theory of E*. 


substitution A mapping from variables to terms extended to a mapping from terms to terms. 


subterm ordering Denoted t-<s, this partial ordering on terms holds when t is a subterm of s. 


theory A set of equations that is closed under rules of inference. 

U-Homog A homogenizing function on terms that replaces subterms with elements of U 
rather than new variables as in Homog. fis used when the value of F is clear from 
context. 

U-HomogMap A homogenizing function on substitutions, denoted o, that homogenizes each 


term in the range and maps some elements of U to others. 


unifier in the classical case, a substitution, a. is a unifier of two terms t and s if and only if 
ot=oas. In the more general equational unification, term equality if replace by 
equality in an equational theory. 


CR-unify An algorithm for unifying in confined regular theories. 
uninterpreted Function symbols that do not appear in a presentation of a theory and unin- 
terpreted. 


universal E-preserving substitution 
The substitution, y, that maps the homogeneous form of a term under U-Homog to 
a term within E of the original. 


variables The universe of variables is denoted V while individual variables are denoted by 
eitheru,v,w,x,y,orz. 


1A 


mil 


E* 


AW(E) 


Appendix C 


Special Symbols 


An algebra. 


The noetherian ordering on inputs to CR-unify used to show termination. <e is 
the lexicographic extension of the cardinalities of »y and +. 


The domain of a substitution, i.e., (0) = {vljov #v}. 


Used to denote the ordering on substitutions, , < p, if and only if there exists p, 
such that 9.°P, = Pp: 


The congruence relation on terms defined by an equational theory of E. Also used 
to denote the congruence relation extended to substitutions. 


The equational theory presented by the set of axioms, E. |.e., E denotes any set of 
equations whereas E* denotes a closed set. 


The set of equations valid in all models of the class M. 
The universe of function symbols, i.e., the signature of the entire theory E*. 
The set of function symbols in t 


A substitution, usually used to denote the preserving substitution of a term and its 
homogeneous form or the combined preserving substitution for a pair of terms. 


The set of all ground terms formable from F. 

The empty substitution. 

The function symbol at the head (or root) of t. 

The empty occurrence, i.e., the empty string. For any term t/e =f. 


The set of all variables in the range of a substitution, i.e., J(o)={v] vEVMt) for 
t€R(a)}. 


An ordering on pairs of terms defined in [Kirchner 85] that is used in defining a 
strict equational theory. One pair is less than another if a term in the first pair is a 


variable, and that variable occurs in a non-variable term in the second pair. 


The set of all models of E. 
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p(t, s) 


O(t) 


Parents(t, $s) 
ParSets(t, s) 


nN 


git 


The universal! E-preserving substitution. 

An A-assignment, or interpretation, mapping terms to objects in an algebra. 

The null operator, used to denote the parent of the a term in itself. 

The set of variables in t and s that occur under more than one equivalence class 
of parent operators. The cardinality of » is denoted by v. (vy is in no way related to 
the use of » as an A-assignment.) 

The set of special variables used to denoted congruence classes of terms. 


The set of occurrences int 


A substitution, used in this thesis to denote factors of a unifier as it is being built in 
CR-unify. 


The set of parent operators of tins. 
The equivalence classes of parent operator of tins. 
The partitioned presentation, usually assumed to present the theory E*. 


The equivalence relation on function symbols defined by the partitioned presen- 
tation, 7. 


A substitution. 


A substitution, used in this thesis to denote a sub-theory unifier of two 
homogeneous terms. 


A substitution, used the CR-unify to denote a unifier or, when subscripted, a par- 
tially formed unifier. 


The terms in the range of a substitution, i.e., ®(o) = {av]v €D(o)}. 
The set of all possible substitutions. 
The set of significant subterms of f and s. The cardinality of + is denoted by r. 


The set of all terms formable from F and V. 


The quotient algebra of the term algebra modulo and equation theory congruence 
relation on terms. 


The universe of variables. 
The set of variables in t. 


Restricts the domain of a substitution: of, = {v¢ov | v€V}. Also used to restrict 
sets of substitutions: 2], = {o], | o€ 2}. 
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| 


LA 


Functional composition. For substitutions p, and ¢,, p,°p,t = P,(Pot) for any 
term f. 


The function symbols at the head of a term t. 

Validity. 

Set of all E-unifiers of two terms. 

Complete set of unifiers. 

Minimal and complete set of unifiers 

The equivalence class of function symbols (defined by 2) that contains /. 

The homogeneous form of t. Homog(t, [t-head}). Homogenization is done with 
respect to the set of relevant function symbols for some sub-theory of E* such 
that the head of t is in the set. Le., itis the maximum homogeneous term at the top 


of t where new variables take the place of subterms outside the homogeneous 
part. 


The homogeneous form of t which is similar to f except subterms are replaced 
with elements of the special set U rather than new variables. 


The subterm ordering on terms. |.e., t<s if and only if tis a subterm of s. t<s may 
be used if tis a proper subterm of s. 


The significant subterm ordering, i.e., tps if and only if tis asubterm of s andtis 
significant in s. if tis also know to be proper in s, t<ys may be used. 
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